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Abstract 

One fundamental problem in the field of network coding is to determine the network coding capacity 
of networks under various network coding schemes. In this thesis, we address the problem with two 
approaches: matroidal networks and capacity regions. 

In our matroidal approach, we prove the converse of the theorem which states that, if a network 
is scalar-linearly solvable then it is a matroidal network associated with a representable matroid 
over a finite field. As a consequence, we obtain a correspondence between scalar-linearly solvable 
networks and representable matroids over finite fields in the framework of matroidal networks. 
We prove a theorem about the scalar-linear solvability of networks and field characteristics. We 
provide a method for generating scalar-linearly solvable networks that are potentially different from 
the networks that we already know are scalar-linearly solvable. 

In our capacity region approach, we define a multi-dimensional object, called the network capac- 
ity region, associated with networks that is analogous to the rate regions in information theory. For 
the network routing capacity region, we show that the region is a computable rational polytope and 
provide exact algorithms and approximation heuristics for computing the region. For the network 
linear coding capacity region, we construct a computable rational polytope, with respect to a given 
finite field, that inner bounds the linear coding capacity region and provide exact algorithms and 
approximation heuristics for computing the polytope. The exact algorithms and approximation 
heuristics we present are not polynomial time schemes and may depend on the output size. 
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Chapter 1 



Introduction 



Network coding is a field at the intersection of network information theory and coding theory. 
The central idea of the field of network coding is that increased capabilities of intermediate nodes 
lead to improvements in information throughput of the network. In the traditional routing model 
of information networks, intermediate nodes simply copy and forward incoming packets, and the 
information flows were modeled as source-to-sink paths and Steiner trees. In the network coding 
model, intermediate nodes are now allowed more complicated operations to code on incoming 
packets and forward packets that might differ significantly from the incoming packets. It has been 
shown numerous times that the network coding model allows greater information throughput than 
in the traditional routing model and its applicability has been widely researched. One fundamental 
problem in the field of network coding is to determine the network coding capacity, the maximum 
amount of information throughput, of networks under various network coding schemes. In this 
work, we address the problem with two approaches: matroidal networks and capacity regions. 

1.1 Network Coding Model 

We give a network coding model that we will use in this work. Most of it is adapted from [8]. 
Further additional definitions are relegated to relevant chapters. Throughout the work, we assume 
that the networks are acyclic and the edges (or links) between nodes are delay-free and error-free. 

Definition 1 (Network). A network M is a finite, directed, acyclic multigraph given by a 6-tuple 
(v,e,pt,A,S,R) where 
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1. v is a node set, 

2. e is an edge set, 

3. \x is a message set, 
4- A is an alphabet, 

5. S : v — > 2^ is a source mapping, and 

6. R : v — > 2^ is a receiver mapping. 

We use a pair of nodes (x, y) to denote a directed edge from node x to node y; x is the start 
node and y is the end node. For each node x, if S(x) is nonempty then x is a source and if R(x) 
is nonempty then x is a receiver. The elements of S(x) are called the messages generated by x and 
the elements of R(x) are called the messages demanded by x. An alphabet A is a finite set with at 
least two elements. Each instance of a message is a vector of elements from the alphabet. For each 
node x, let In(x) denote the set of messages generated by x and in-edges of x. Let Out(x) denote 
the set of messages demanded by x and out-edges of x. For each node x, we fix an ordering of In(x) 
and Out(x) such that all messages occur before the edges in the resulting lists. In our definition of 
networks, there could be multiple source nodes and multiple receiver nodes with arbitrary demands. 

There are several special classes of networks: unicast networks where there are exactly one 
message, one source and one receiver; multicast networks where there are exactly one message and 
one source, but an arbitrary number of receivers that demand the message; two-level multicast 
networks where there are multiple messages and there are exactly one source node that generates 
all the network messages and two receivers where one demands all the messages and the other 
demands a subset of the messages; multiple unicast networks where there are multiple messages 
and, for each message m, we have the unicast condition; multiple multicast networks where there 
are multiple messages and, for each message m, we have the multicast condition. A general network 
has multiple source nodes, multiple receiver nodes, and arbitrary demands of messages; they are 
sometimes referred to as multi-source multi-sink networks in literature. A multicast network in 
literature usually has multiple messages, but we restrict multicast networks to those with a single 
message in this work. 
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We define edge function, decoding function, message assignment and symbol function with 
respect to a finite field F of cardinality greater than or equal to |„4|. We choose such F so that 
each element from A can be uniquely represented with an element from F. 

Definition 2 (Edge and Decoding Functions). Let k and n be positive integers. For each edge 
e = (x, y), an edge function is a map 

f e : (F k ) a x {F n f -»• F n , 

where a and (3 are number of messages generated by x and in-edges of x, respectively. For each 
node x G v and message m G R(x), a decoding function is a map 

fx,m ■ (F k ) a X (F n f -> F\ 

where a and (3 are number of messages generated by x and in-edges of x, respectively. We call k 
and n the source dimension and edge dimension, respectively. 

Each source sends a message vector of length k and each edge carries a message vector of 
length n. We denote the collections of edge and decoding functions by F e = {f e : e € e} and 
Fd = {f x ,m ■ x e v,m e R(x)}. 

Definition 3 (Message Assignment). A message assignment is a map a : fi — > F k , i.e., each 
message is assigned with a vector from F k . 

Definition 4 (Symbol Function). A symbol function is a map s : e — > F n defined recursively, with 
respect to M and F e , such that for all e = (x,y) £ e, 

s(e) = / e (a(mi), . . . , a(m a ), s(e a+1 ), s(e Q+/9 )), 

where mi, . . . , m a are the messages generated by x and e Q +i, . . . , e a+ p are the in-edges of x. Note 
that the symbol function is well-defined as network M is a directed acyclic multigraph. 

Definition 5 (Network Code). A network code on N is a 5-tuple (F, k,n, F e , Fa) where 

1. F is a finite field, with \F\ > \A\, 



11 



2. k is a source dimension, 

3. n is an edge dimension, 

4- T e is a set of edge functions on network M, 

5. Td is a set of decoding functions on network M . 

We shall use the prefix (k, n) before codes when we wish to be more specific on parameters k 
and n. When k and n are clear from the context, we will sometimes omit them. There are several 
special classes of network codes: routing network codes, where edge and decoding functions simply 
copy input vector components to output vector components, linear network codes, where edge 
and decoding functions are linear over F, and nonlinear network codes, where edge and decoding 
functions are nonlinear over F. Vector-linear network codes are linear network codes with k = n. 
Scalar-linear network codes are linear network codes with k = n = 1. 

Definition 6 (Network Code Solution). A network code (F, k, n, T e , J-j) is a network code solution, 
or solution for short, if for every message assignment a : fJL — > F k , 

fx, m {a{mi), a(m a ), s(e a+1 ), s{e a+ p)) = a(m), 

for all x G u and m G R{x). Note that m\, . . . , m a are messages generated by x, and e a +i, • • • , e Q +/3 
are in-edges of x. If the above equation holds for a particular node x G v and message m G R(x), 
then we say node x's demand m is satisfied. 

A network N is routing-solvable if it has a routing network code solution. Similarly, we say that 
network is linearly solvable (scalar-linearly solvable, vector-linearly solvable, nonlinearly solvable) 
if it has a linear (scalar-linear, vector-linear, nonlinear) network code solution. 

1.2 Previous Works 

In a seminal work in 2000, Ahlswede et al. [1] introduced the network coding model to the prob- 
lem of communicating information in networks. They showed that the extended capabilities of 
intermediate nodes to code on incoming packets give greater information throughput than in the 
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traditional routing model. They also showed that the capacity of any multiple multicast network 
of a certain class is equal to the minimum of min-cuts between the source node and receiver nodes. 

Single source networks and linear network coding are comparatively well-understood. Li et 
al. [22] showed that linear network coding is sufficient for certain multiple multicast networks. 
Koetter and Medard |20j reduced the problem of determining scalar-linear solvability to solving a 
set of polynomial equations over some finite field and suggested connections between scalar-linearly 
solvable networks and nonempty varieties in algebraic geometry. They showed that scalar-linear 
solvability of many special case networks, such as two-level multicasts, can be determined by their 
method. Dougherty et al. [9] strengthened the connection by demonstrating solvably equivalent 
pairs of networks and polynomial collections; for any polynomial collection, there exists a network 
that is scalar-linearly solvable over field F if and only if the polynomial collection is solvable over 
F. It is known that scalar-linear network codes are not sufficient in general. The M-network 
due to Koetter in [23] is a network with no scalar-linear solution but has a vector-linear solution. 
Lehman and Lehman [21] using 3-CNF formulas also provided an example where a vector solution 
is necessary. 

More recently, matroidal approaches to analyze networks have been quite successful. Dougherty 
et al. [8] defined and studied matroidal networks and suggested connections between networks 
and matroids. They used matroidal networks constructed from well-known matroids to show in 
[5] that not all solvable networks have a linear solution over some finite-field alphabet and vector 
dimension. They also constructed a matroidal network to show that Shannon-type information 
inequalities are not sufficient for computing network coding capacities in general. Recently, El 
Rouayheb et al. |1U] strengthened the connection between networks and matroids by constructing 
"solvably equivalent" pairs of networks and matroids via index codes with their own construction 
method; the network has a vector-linear solution over a field if and only if the matroid has a 
multilinear representation over the same field. In another recent work [25J, Sun et al. studied 
the matroid structure of single-source networks which they define as network matroid and showed 
connections between the network matroids and a special class of linear network codes. 

The capacity regions of networks are less well-understood, but a few explicit outer bounds of 
capacity regions of networks exist. One easy set of outer bounds is the max-flow/min-cut bounds, 
which were sufficient in the case of certain multiple multicast networks. Harvey et al.[16] combined 
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information theoretic and graph theoretic techniques to provide a computable outer bound on 
the network coding capacity regions of networks. Yan et al.[27j gave an explicit outer bound for 
networks that improved upon the max-flow/min-cut outer bound and showed its connection to a 
kind of minimum cost network coding problem. They used their results to compute the capacity 
region of a special class of 3-layer networks. Thakor et al.[26j gave a new computable outer bound, 
based on characterizations of all functional dependencies in networks, that is provably tighter than 
those given in [16] and [27]. 

Recently, explicit characterizations of capacity regions, albeit hard to compute, of networks were 
given using information theoretic approaches. Yan et al.[28] provided an exact characterization of 
the capacity regions for general multi-source multi-sink networks by bounding the constrained 
regions in the entropy space. However, they noted that explicitly evaluating the obtained capacity 
regions remains difficult in general. In a related work, Chan and Grant [3] showed that even the 
explicit characterization of capacity regions for single-source networks can be difficult since the 
computation of a capacity region reduces to the determination of the nonpolyhedral set of all 
entropy functions and that linear programming bounds do not suffice. 

The routing capacity regions of networks are better understood via linear programming ap- 
proaches. Cannons et al.[2] defined the notion of network routing capacity that is computable with 
a linear program and showed that every rational number in (0, 1] is the routing capacity of some 
solvable network. Yazdi et al. j29l .30] extended a special case of Farkas Lemma called the "Japanese 
Theorem" to reduce an infinite set of linear constraints to a set of finitely many linear constraints 
in terms of minimal Steiner trees and applied the results to obtain the routing capacity region of 
undirected ring networks. In a subsequent work, Kakhbod and Yazdi[l9j provided the complexity 
results on the description size of the finitely many inequalities obtained in [29\ [30] and apply them 
to the undirected ring networks. 

1.3 Our Results 

We organize our contributions into two parts: matroidal networks and network capacity regions. 
In both approaches, we provide examples to demonstrate our main ideas. 
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1.3.1 Matroidal Networks 

In our matroidal approach, we further study the matroidal networks introduced by Dougherty et 
al. [8]. Our contributions can be summarized as follows and we refer to Chapter [2] for details: 

1. We prove the converse of a theorem in [8] which states that, if a network is scalar- linearly 
solvable then it is a matroidal network associated with a representable matroid over a finite 
field. 

2. We prove a theorem about the scalar-linear solvability of networks and field characteristics. 

3. We provide a method for generating scalar- linearly solvable networks that are potentially 
different from the networks that we already know are scalar-linearly solvable. 

As a consequence, we obtain a correspondence between scalar-linearly solvable networks and 
representable matroids over finite fields in the framework of matroidal networks. It also follows 
that determining scalar-linear solvability of a network J\f is equivalent to determining the existence 
of a representable matroid A4 over a finite field and a valid network-matroid mapping between M. 
and M . We obtain a set of scalar-linearly solvable networks that are potentially different from the 
networks that are already known to be scalar-linearly solvable. 

1.3.2 Network Capacity Regions 

In our work concerning the network capacity regions, we continue the research along the lines 
of work by Cannons et al. [2]. Our contributions can be summarized as follows and we refer to 
Chapter [3] for details: 

1. We define the network capacity region of networks and prove its notable properties: closed- 
ness, boundedness and convexity. 

2. We show that the network routing capacity region is a computable rational polytope and 
provide exact algorithms and approximation heuristics for computing the region. 

3. We define the semi-network linear coding capacity region that inner bounds the corresponding 
network linear coding capacity region, show that it is a computable rational polytope and 
provide exact algorithms and approximation heuristics for computing it. 
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While we present our results for the general directed acyclic networks, they generalize to directed 
networks with cycles and undirected networks. We note that the algorithms and heuristics we 
provide do have not polynomial running time in the input size. As our notion of the multi- 
dimensional network capacity region captures the notion of the single-dimensional network capacity 
in [2], our present work, in effect, addresses a few open problems proposed by Cannons et al. [2]: 
whether there exists an efficient algorithm for computing the network routing capacity and whether 
there exists an algorithm for computing the network linear coding capacity. It follows from our 
work that there exist combinatorial approximation algorithms for computing the network routing 
capacity and for computing a lower bound of the network linear coding capacity. 
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Chapter 2 

Matroidal Networks Associated with 
Representable Matroids 

In this chapter, we further study the matroidal networks introduced by Dougherty et al. [8]. We 
prove the converse of a theorem in jHj which states that, if a network is scalar dinearly solvable then 
it is a matroidal network associated with a representable matroid over a finite field. From [8] and 
our present work, it follows that a network is scalardinearly solvable if and only if it is a matroidal 
network associated with a representable matroid over a finite field. The main idea of our work is 
to construct a scalardinear network code from the network-matroid mapping between the matroid 
and network. Thereby, we show a correspondence between scalardinearly solvable networks and 
representable matroids over finite fields in the framework of matroidal networks. It follows that 
determining scalardinear solvability of a network J\f is equivalent to determining the existence of a 
representable matroid Ai over a finite field and a valid network-matroid mapping between A4 and 
N . We also prove a theorem about the scalardinear solvability of networks and field characteristics. 
Using our result and the matroidal network construction method due to Dougherty et al., we note 
that networks constructed from representable matroids over finite fields are scalardinearly solvable. 
The constructed networks are potentially different from the classes of networks that are already 
known to be scalardinearly solvable. It is possible that our approach provides a superset, but this 
is unknown at this time. 
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2.1 Definitions 



Definition 7 (Global Linear Network Code). A global linear network code is a 5-tuple (F, k, n, 

(fiedge) where 

1. F is a finite field, with \F\ > \A\, 

2. k is a source dimension, 

3. n is an edge dimension, 

4- 4>msg is the global coding vector function on messages, 4>msg '■ M ~~ > (F kxk )\^, such that for 
message m, </> ms9 (m) = (Mi, . . . , M\^) T where Mi is a k x k matrix over F , and 

5- 4>edge is the global coding vector function on edges, (f) e dge '■ £ (F nxk )\^, such that for each 
edge e, 4> e dge(e) = (Mi, . . . , M\^) T where Mi is a n x k matrix over F. 

Definition 8 (Global Linear Network Code Solution). A global linear network code (F, k, n, 
4>msg, <t>edge) is a global linear network code solution, if \F\ > \A\ and the following conditions are 
satisfied: 

1. For each message m G /i, (f>msg 

(m) = (0, . . . , 0,I hxk , 0, . . . , 0) T where I kxh is thekxk identity 
matrix over F and is in the coordinate corresponding to message m. 

2. For each node x G v and edge e € Out(x), if <fi e dge(e) = (M\, . . . , M| /J |) T ; then there exist 
matrices C\, . . . , C a+ p over F such that Mi = Y2°j=i CjMj , for i = 1, . . . , 

3. For each node x G v and message to G Out(x), if 4>m Sg {m) = (Mi, . . . , M| M |) T , then there 
exist matrices C[, . . . , C' a+j3 over F such that Mi = Y^j=\ CjM? , for i = 1, . . . 

Where, if mi, . . . , m a are messages generated by x and e a+ i, . . . , e a+ p are in-edges of x, msff (mj) = 
(M(, . . . , Mj^|) T for j = 1, . . . , a and cPedgeiej) = (M(, . . . , Mfa) T for j = a + 1, . . . , a + /3; 
Ci, . . . ,C a are n x k matrices and C Q +i, . . . , C a+ p are n x n matrices that would appear as co- 
efficients in a linear edge function; and C[, . . . ,C' a are k x k matrices and C' a+1 , . . . , C' a+ ^ are 
k x n matrices that would appear as coefficients in a linear decoding function. 

As with the network codes, we shall sometimes use the prefix (k, n) to emphasize the source 
and edge dimensions or omit k and n if they are clear from the context. It is straightforward 
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to check that the notions of linear network code solution and global linear network code solution 
are equivalent, as noted in previous works in algebraic network coding (for instance, [20] for the 
k = n = 1 case). 

Proposition 9. Let N = (is, e, /i, A, S, R) be a network. Then, J\f has a (k, n) linear network code 
solution if and only if it has a (k, n) global linear network code solution. 

Proof. Let (F, k, n, T e , J-j) be a (k,n) linear network code solution for jV. Since M is a directed 
acyclic graph, we can order nodes in v with a topological sort so that each edge go from a lower- 
ranked node to a higher-ranked node. The ordering of nodes induces an ordering of edges ei, . . . , ei e i 
such that no path exists from ei to ij for i > j. We define 4> msg for all m and 4> e d ge for ei, . . . , ei e i 
in that order: 

1. For each message m, we define (ftmsg (m) = (0, . . . , 0, I kxk , 0, . . . , 0) T where I kxk is the k x k 
identity matrix and is in the coordinate corresponding to m. 

2. For each ej = (x,y), the edge function /g. can be written as 

a a+/3 

/^(a(mi), . . . ,a(m a ),s(e a +i), • • • , s(e a+/S )) =y^C r a(m/) + ^ Q ■ s(e ; ), 

1=1 l=a+l 

where mi, . . . , m a are messages generated by x and e a+ \, . . . ,e a +p are in-edges of x; and 
Ci, . . . , C Q are nxk matrices and C a +i, ■ ■ ■ , Ca+p are nxn matrices over F. Let <fimsg(mj) = 
(Ml Mfa) T for j = 1, . . . , a and </w 5e ( ej ) = (M{ , . . . , M^) T for j = a + 1, . . . , a + /3. 
We define ^ 9e (e,) = (Mi, . . . , M H ) T where Mj = Q ■ M\. 

Note that s(ej) = Yl\=i Mi • a(rrii). By construction, (F, k, n, 4> mS g, 4>edge) is a valid (k, n) global 
linear code that satisfies the first two properties of global linear network code solutions. We check 
the third property. For each x £ v and m G R(x), the decoding function f x ,m can be written as 

a a+f) 

fx,m(a(mx), a(m a ), s(e a+1 ), s(e a+ p)) = ^ Q • a(m{) + ^ Q ■ s(e t ), 

1=1 l=a+l 

and f XjTn (a(mi), . . . ,a(m a ), s(e a+ i), . . . , s(e a+l 3)) = a(m). Note that mi,...,m Q are messages 
generated at x and e^+i, . . . , e a+ p are in-edges of x; Ci, . . . , C a are k x k matrices and C Q +i, . . ., 
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C a+ p are k x n matrices. Let (frmsgimj) = (M(, . . . , M^)' 1 ' for j = l,...,a and 4> e dge(ej) = 
(M( , M^) T for j = a + 1, . . . , a + p. It follows that [<t> m3g {m)]i = Y%=\ Q ■ M \ for all i, where 
[(f>msg('m)]i denotes the i-th coordinate of 4>msg(fn)', (F,k,n,<j) ms g,(f) ec i g e) is a (k,n) global linear 
network code solution. 

The converse direction is similar and so we only sketch the proof. Let (F, k, n, <j>m Sg , 4> e dge) be a 
(k,n) global linear network code solution for J\f. For each edge e, we define edge function f e by 

a a+/3 

f e (a(m 1 ),...,a(m a ),s(e a+1 ),...,s(e a+l3 )) = ^Ci-a(rni)+ ^ Q ■ s(e t ), 

1=1 l=a+l 

where C\, . . . ,C a +/3 are some matrices satisfying Definition [8j For each x G v and m G R(x), we 
define decoding function / x>m similarly using matrices C%, . . . , C a+ p from Definition [8j □ 

We have the following corollaries from the definitions: 

Corollary 10. Let M = (V, e, [i, A, S, R) be a network. Then, J\f has a (k, k) vector-linear network 
code solution if and only if it has a (k, k) global vector-linear network code solution. 

Corollary 11. Let M = (u, e, fi, A, S, R) be a network. Then, M has a scalar-linear network code 
solution if and only if it has a global scalar-linear network code solution. 

In this chapter, we will focus on scalar-linear network codes, that is linear network codes with 
k = n = 1. 

2.2 Matroids 

We define matroids and three classes of matroids. See [23] for more background on matroids. 

Definition 12. A matroid A4 is an ordered pair (S,I) consisting of a set S and a collection Z of 
subsets of S satisfying the following conditions: 

1. (Del; 

2. Ifl€l and I' C I, then I' G X; 

3. If I\ and I2 are in T and \I\\ < I-Z2I, then there is an element e of I2 \I\ such that I\ U{e} G X. 
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The set S is called the ground set of the matroid M. A subset X of S is an independent set if it 
is in X; X is a dependent set if not. A frase of is a maximal independent set; for all elements 
e £ S \ B, B (J {e} ^ X. It can be shown that all bases have the same cardinality. A circuit of M 
is a minimal dependent set; for all elements e in C, C \ {e} £ X. For each matroid, there is an 
associated function r called rank that maps the power set 2 into the set of nonnegative integers. 
The rank of a set X C S is the maximum cardinality of an independent set contained in X. 

Definition 13 (Matroid Isomorphism). Two matroids Mi = (Si,l±) and M2 = (£27X2) are 
isomorphic if there is a bijection map ip from S± to S% such that for all X C S±, X is independent 
in M-i if and only if ip(X) is independent in Ai 2 . 

Definition 14 (Uniform Matroids). Let c,d be nonnegative integers such that c < d. Let S be a 
d-element set and X be the collection {X C S : \X\ < c}. We define the uniform matroid of rank 
c on the d-element set to be U c ^ = (5,X). 

Definition 15 (Graphic Matroids). Let G be an undirected graph with the set of edges, S. Let 
X = {X C S : X does not contain a cycle}. We define the graphic matroid associated with G as 
M(G) = (S,X). 

Definition 16 (Representable/Vector Matroid). Let A be a d\ x d 2 matrix over some field F. Let 
S = {1, . . . , efe} where element i in S corresponds to the ith column vector of A and X = {X C S : 
corresponding column vectors form an independent set}. We define the vector matroid associated 
with A as Ai(A) = (S,I). A matroid M. is F-representable if it is isomorphic to a vector matroid 
of some matrix over field F. A matroid is representable if it is representable over some field. Note 
that F is not necessarily finite. 

The bases of U Ct d = (5,X) are exactly subsets of S of cardinality c and the circuits are subsets of 
S of cardinality c+1. Each base of M(G) is a spanning forest of G, hence an union of spanning trees 
in connected components of G, and each circuit is a single cycle within a connected component. 
It is known that the graphic matroids are representable over any field F. On the other hand, the 
uniform matroid C/2,4 is not representable over GF{2). 
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2.3 Matroidal Networks 

We define matroidal networks and present a method for constructing matroidal networks from 
matroids; for more details and relevant results, we refer to [8]. 

Definition 17. Let N be a network with message set \x, node set v , and edge set e. Let M = (S,I) 
be a matroid with rank function r. The network N is a matroidal network associated with A4 if 
there exists a function f : fi U e — > S, called the network-matroid mapping, such that the following 
conditions are satisfied: 

1. f is one-to-one on \i; 

2. /(m) g I; 

3. r(/(In(x))) = r(/(In(x) U Out(x))), for every x £ v. 
We define f(A) to be {f(x) \ x £ A} for a subset A of n U e. 

Theorem 18 (Construction Method). Let M. = (S,I) be a matroid with rank function r. Let M 
denote the network to be constructed, its message set, v its node set, and e its edge set. Then, 
the following construction method will construct a matroidal network M associated with A4. We do 
not address issues of complexity of the method. 

We choose the alphabet A to be any set with at least two elements. The construction will 
simultaneously construct the network N , the network-matroid mapping f : \i U e — > S, and an 
auxiliary function g : S — ^ v, where for each x G S, g(x) is either 

1. a source node with message m and f(m) = x; or 

2. a node with in-degree 1 and whose in-edge e satisfies /(e) = x. 

The construction is completed in 4 steps and each step can be completed in potentially many different 
ways: 

Step 1: Choose any base B = {b\, . . . ,b r rg\} of M. Create network source nodes n\, . . . ,n r ^ and 
corresponding messages mi, . . . ,m r fs)> one a ^ each node. Let f{rrii) = bi and g{bi) = n{. 
Step 2: (to be repeated until no longer possible). 

Find a circuit {xq, . . . , Xj} in M such that g{x\), . . . , g(xj) have been already defined but not g(xo). 
Then we add: 
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1. a new node y and edges e±, . . . , e* such that ei connects g(xi) to y. Let /(e^) = X{. 

2. a new node no with a single in-edge eo that connects y to no- Let /(eo) = xq and g(xo) = no- 
Step 3: (can be repeated arbitrarily many times). 

If {xq, ■ ■ ■ , Xj} is a circuit of Ai and g(xo) is a source node with message mo, then add to the 
network a new receiver node y which demands the message mo and has in-edges ei, . . . , ej where e, 
connects g(xi) to y. Let /(ej) = Xj. 
Step 4-' (can be repeated arbitrarily many times). 

Choose a base B = {x\, . . . ,x r rg\} of Ai and create a receiver node y that demands all the network 
messages and has in-edges ei, . . . , e r rs) where e-i connects g(xi) to y. Let /(e^) = Xj. 

The following theorem is from [5]. The original theorem states with a representable matroid, 
but the same proof still works with a representable matroid over a finite field. 

Theorem 19. If a network is scalar-linearly solvable over some finite field, then the network is 
matroidal. Furthermore, the network is associated with a representable matroid over a finite field. 



2.4 Scalar- linear Solvability 



We prove the converse of Theorem 19 and that a network is scalar-linearly solvable over a finite field 
of characteristic p if and only if the network is a matroidal network associated with a representable 
matroid over a finite field of characteristic p. In what follows, we assume that c?2 > d\. 

Lemma 20. Let A be a d± x di matrix over a finite field F and M(A) be the corresponding 
representable matroid. Then, there exists an arbitrarily large finite field F' and a d\ x di matrix 
A 1 over F' such that the corresponding matroid Ai (A') is isomorphic to Ai (A) . 

Proof. We show that any finite field F' that contains F as a subfield works; for instance, extension 
fields of F. We consider the same matrix A over F' , so choose A' = A, and show that a set of 
column vectors of A is independent over F if and only if it is independent over F' . Assume columns 
vi, . . . ,Vk are dependent by some scalars a^s in F, a\V\ + • • • + atVk = 0. Since F' contains F, all 
operations with elements of the subfield F stay in the subfield, and the same scalars still work in 
F' , i.e., a\V\ + • • • + a^v^ = in F' . Hence, the vectors are dependent over F' . Assume column 
vectors v\,. . . ,Vk are independent over F. We extend the set of vectors to a basis of F dl . Then, 
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the matrix formed by the basis has a nonzero determinant over F. By similar reasons as before, the 
same matrix has a nonzero determinant when considered as a matrix over F' . Hence, the column 
vectors of the basis matrix are independent over F' and, in particular, the column vectors v\ , . . . , Vk 
are independent over F' . □ 

Theorem 21. If a network M is matroidal and is associated with a representable matroid over a 
finite field F , then N is scalar-linearly solvable. 

Proof. Let N = {y, e, /x, A, S, R) be a matroidal network. Let A be the d% x c?2 matrix over the finite 
field F such that J\f is a matroidal network associated with the corresponding matroid Ai(A) = 
(S,T). By Lemma [2Q| we assume that the finite field F is large enough to represent all elements 



in A, i.e., \F\ > |^4|. By Definition 17, there exists a network-matroid mapping / : \i U e — > S. 
Assume r(S) = d\\ otherwise, we remove redundant rows without changing the structure of the 
matroid. Let /(//) = {ii, ■ ■ ■ , iui}- As /(/i) E X, the columns indexed by /(//) form an independent 
set. We extend f(pi) to a basis B of F dl , if necessary, by adding column vectors of A. Without loss 
of generality, assume the first d\ columns of A form the basis B after reordering. By performing 
elementary row operations, we uniquely express A in the form 

A = [I dl | A'] 

where A' is a d\ x (d% — d\) matrix and such that {i\, . . . now corresponds to the first 

columns of A. Note that the structure of the corresponding matroid stays the same. We introduce 
dummy messages muni, . . . , m^, if necessary, by adding a disconnected node that generates these 
messages. We assign global coding vectors on the resulting M as follows: 

1. for each edge e, let (pedge(^) = Afi e y : and 

2. for each message m, let cj> msg (m) = Afr m \, 

where Ai denotes the i-th column of A. We show that the global linear network code defined above 
is valid and satisfies all the demands. For each node x £ we have r(/(In(x))) = r(/(In(x) U 
Out(x))). It follows that for each edge e G Out(rr), Afr e \ is a linear combination of {Afr e i\ : 
e' G In(x)}. Equivalently, (f> e dge(e) is a linear combination of coding vectors in {0 msg (m) : m G 
In(x)} U {4>edge(e) ■ & G In(x)}. For each message m G Out (a;), Afr m \ is a linear combination of 
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{Af( e i^ : e' G In(x)}. Similarly, ms5 (m) is a linear combination of coding vectors in {4> m sg( m ) '■ 
m 6 In(x)} U {4> e dge(e) ■ e 6 In(x)}. Note, furthermore, that 4>m Sg (m) is the standard basis vector 
corresponding torn. It follows that the global linear network code (F, F e ,Fd) thus defined is a global 
linear network code solution. Removing the dummy messages, it follows that TV is scalar-linearly 
solvable. □ 

Given an arbitrary matrix A, assigning its column vectors as global coding vectors will not 
give a global linear network code solution necessarily. In essence, the theorem shows that, while 
we cannot use column vectors of A directly, we can do the described operations to produce an 
equivalent representation of A from which we can derive a global linear network code solution. 



From Theorems 18 and 21 we obtain a method for constructing scalar-linearly solvable networks: 



pick any representable matroid over a finite field F and construct a matroidal network N using 



Theorem 18 Combining Theorems 19 and 21 , we obtain the following theorem. 



Theorem 22. A network is scalar-linearly solvable if and only if the network is a matroidal network 
associated with a representable matroid over a finite field. 

One implication of the theorem is that the class of scalar-linearly solvable networks in the 
algebraic network coding problem corresponds to the class of representable matroids over finite fields 
in the framework of matroidal networks. In effect, our results show a connection between scalar- 
linearly solvable networks, which are tractable networks for network coding, and representable 
matroids over finite fields, which are also particularly tractable in terms of description size. 

In light of Dougherty et al.'s approach [TIE], relationships between field characteristics and linear 
solvability of matroidal networks are important. In the case of scalar-linear network codes, we fully 
characterize a relationship with the following theorem. Note that a network might be a matroidal 
network with respect to more than one representable matroids of different field characteristics and, 
thus, is possibly scalar-linearly solvable with respect to fields of different characteristics. 

Theorem 23. A network is scalar-linearly solvable over a finite field of characteristic p if and only 
if the network is a matroidal network associated with a representable matroid over a finite field of 
characteristic p. 



Proof. We extend Theorems n9\ and 21 and Lemma 20 to include field characteristic p, and the 



statement follows straightforwardly. □ 
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Corollary 24. Any matroidal network M associated with an uniform matroid is scalar -linearly 
solvable over a sufficiently large finite field of any characteristic. The same holds for the graphic 
matroids. 

Proof. It is straightforward to show that for any uniform matroid M and a prime p, there is a 
sufficiently large finite field F of characteristic p and a matrix A such that A4 is a representable 
matroid associated with A over F. The same is true for graphic matroids. □ 

As a consequence, any matroidal networks constructed from uniform or graphic matroids will 
not have interesting properties like those constructed from the Fano and non-Fano matroids in 
Dougherty et al. 018]. 



2.5 Examples 



In this section, we provide examples of scalar-linearly solvable networks that follow from Theorem 
21 As mentioned before, we get a method for constructing scalar-linearly solvable networks from 



Theorems 18 and|21| pick any representable matroid over a finite field F and construct a matroidal 
network. We assume A = {0, 1} throughout this section. 



The Butterfly network N\ in Fig. 2-1 is a matroidal network that can be constructed from 
the uniform matroid f/2,3- The ground set S of £/2,3 is {a, 6, c}. Nodes 1-2 are the source nodes 



and nodes 5-6 are the receiver nodes. See Fig. 2-1 and Table |2.1| for details of the construction 
and a global scalar-linear network code solution. Note that the sets under 'Variables' column are 
order-sensitive. 



Network A/2 in Fig. 2-2 is a matroidal network constructed from the uniform matroid 1]% 4. The 



ground set S of L/2,4 is {a, b, c, d}. Nodes 1 and 2 are the source nodes and nodes 7-9 are the receiver 

10 12 



nodes, t/2,4 is a representable matroid associated with A 



has a scalar- linear network code solution over F3. See Fig. 2-2 and Table [2T2] for details. 
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over F3 and, hence, it 



Consider the graph G and the matroidal network A3 constructed from M(G) in Fig. 2-3 The 
ground set S of Ai{G) is {1, . . . , 7}, representing the edges of G. Nodes 1-4 are the source nodes 
and nodes 11-13 are the receiver nodes. A4(G) is a representable matroid over field F2 and, by 



Theorem 21 the network has a scalar- linear network code solution over F2, as shown by the global 
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Figure 2-1: The Butterfly network A/"i 
constructed from C/2,3 




Figure 2-2: Network A2 constructed 
from U2 4 
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Table 2.1: Construction of the Butterfly net- 
work Ai from U2 3- 
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Table 2.2: Construction of A/2 from U2 4 




Figure 2-3: Graph G and network A3 con- 
structed from M.(G) 
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Table 2.3: Construction of A/3 from M{G) 



in Fig. 2-3 



coding vectors on A3 in Fig. 2-3 This example shows that our results provide networks which 
are different from the networks previously known to be scalar- linearly solvable such as multicast, 
2-level multicast and disjoint multicast networks. It is possible that network A3 can be constructed 
from a set of polynomials as in Dougherty et al. [9] or via index codes as in El Rouayheb et al. |10j . 
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2.6 Discussion and Conclusion 



In this chapter, we showed that any matroidal network associated with a representable matroid 
over a finite field is scalar-linearly solvable. Combined with an earlier result of Dougherty et al., it 
follows that a network is scalar-linearly solvable if and only if it is a matroidal network associated 
with a representable matroid over a finite field. It also follows that determining scalar-linear 
solvability of a network is equivalent to finding a representable matroid over a finite field and a 
valid network-matroid mapping. We also showed a relationship between scalar-linear solvability of 
networks and field characteristics. Moreover, we obtained a method for generating scalar-linearly 
solvable networks from representable matroids over finite fields and a set of scalar-linearly solvable 
networks that are possibly different from those networks that we already know are scalar-linearly 
solvable. 

Unfortunately, the results presented in this chapter do not seem to generalize to vector-linear 
network coding or more general network coding schemes. The difficulty is that the matroid structure 
requires that a subset of the ground set of a matroid is either independent or dependent, but what 
this corresponds to in vector-linear codes, for instance, is not clear. Instead of vectors over fields, 
we now have vectors over rings (matrices over a field, to be more specific) in vector-linear network 
coding, and we are unaware of suitable matroids on vectors over rings for our purpose. In fact, 
El Rouayheb et al. [10] also made a similar observation and suggested that FD-relations are more 
related to networks than are matroids. 
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Chapter 3 



Network Capacity Regions 



In this chapter, we study the network capacity region of networks along the lines of work by Can- 
nons et al. [2] . We define the network capacity region of networks analogously to the rate regions 
in information theory and show its notable properties: closedness, boundedness and convexity. In 
the case of routing, we prove that the network routing capacity region C r is a computable rational 
polytope and provide exact algorithms and approximation heuristics for computing the region. In 
the case of linear network coding, we define an auxiliary region C[, called the semi-network linear 
coding capacity region, which is a computable rational polytope that inner bounds the network 
linear coding capacity region, and provide exact algorithms and approximation heuristics for com- 
puting C'j. More specifically, we show the network routing capacity region C r (Ci) is an image of a 
higher- dimensional rational polytope under an affine map and consider the computation of C r (C[) as 
the polytope reconstruction problem with a ray oracle. Our results generalize to directed networks 
with cycles and undirected networks. We note that some underlying problems associated with 
the polytope reconstruction problem are NP-hard, such as the minimum cost directed Steiner tree 
problem, and that algorithms and heuristics we present are not polynomial time schemes. Rather, 
the algorithms and heuristics may have exponential running time in the input size, depending on 
the intermediate computations and the resulting output's description size. 

As our notion of the multi-dimensional network capacity region captures the notion of the single- 
dimensional network capacity in [2], our present work, in effect, addresses a few open problems 
proposed by Cannons et al. [2]: whether there exists an efficient algorithm for computing the 
network routing capacity and whether there exists an algorithm for computing the network linear 
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coding capacity. Computing the single-dimensional network capacity is equivalent to computing 
a point on the boundary of the multi-dimensional network capacity region and a ray starting at 
the origin and reduces to solving associated linear programs in the case of the network routing 
capacity and (a lower bound of) network linear coding capacity. It follows from our work that 
there exist combinatorial approximation algorithms for computing the network routing capacity 
and for computing a lower bound of the network linear coding capacity. 

A polytope has two equivalent descriptions; the vertex description in terms of vertices, or 
the extreme points, of the polytope and the hyperplane description in terms of linear inequalities 
defining facets of the polytope. In this work, we do not distinguish the two descriptions and use them 
interchangeably, but we note that converting one description into another can be computationally 
expensive; we use vertex enumeration algorithms to convert a hyperplane description into a vertex 
one and facet enumeration algorithms (essentially, convex hull algorithms) for the conversion in the 
other direction. See [12\ [T^l [T8] for more details on polytopes and relevant algorithms. 

In this chapter, we consider nondegenerate networks where, for each demand of a message at a 
receiver node, there is a path from a source node generating the message to the receiver node and 
where no message is both generated and demanded by the same node. 

3.1 Fractional Network Coding Model 

We define a fractional network coding model. Most definitions are adapted from Cannons et al.|2j. 
We use the notations where we write vectors or points in a multi-dimensional space with a hat as 
in k and let fcj denote the i-th coordinate of the vector k. We also use [k]i and k(i) to denote the 
i-th coordinate to avoid confusions when necessary. When it is clear from the context, we omit the 
hat to avoid cluttering symbols. 

Definition 25 (Capacitated Network). A capacitated network N is a finite, directed, acyclic 
multigraph given by a 7 -tuple (u,e, n, c,A, S, R) where 

1. v is a node set, 

2. e is an edge set, 

3. ii is a message set, 
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4- c : e — > Z + is an edge capacity function, 

5. A is an alphabet, 

6. S : v — > 2 M is a source mapping, and 

7. R : v — > 2^ is a receiver mapping. 

As we shall use only capacitated networks in this chapter, we use M to denote a capacitated 
network. We refer to networks defined in Chapter [T] as ordinary networks. We assume that the 
messages in [i are indexed as mi, . . . , m^i. We define fractional edge function, fractional decoding 
function, and fractional message assignment with respect to a finite field F, where \F\ > \A\, a 
source dimension vector k, and an edge dimension n: 

Definition 26 (Fractional Edge and Fractional Decoding Functions). Let N = (v,e, fi,c,A, S,R) 
be a capacitated network and mi,...,mu be the messages. Let k = (ki, . . . , fcui) be a vector of 
positive integers and n be a positive integer. For each edge e = (x,y), a fractional edge function is 
a map 

/ e : (F *i ) X • • • X 

where , • • • , rrii a are a messages generated by x and e a +i, . . . , e a +/3 are f3 in-edges of x. For each 
node x € v and message mj S R(x), a fractional decoding function is a map 

fx,mj ■ (F k ^) x • • • x (F k ^) x (i? nc ( e -+!)) x • • • x (F nc(e «+' 3) ) -> F k \ 

where m^, . . . ,mi a are a messages generated by x and e a+ \, . . . ,e a+l 3 are (3 in-edges of x. 

We call k = (k\ , . . . , fci^i ) the source dimension vector, where ki is the source dimension for 
message mi, and n the edge dimension. We denote the collections of fractional edge and fractional 
decoding functions by F e = {f e : e G e} and F x>m = {f x ,m '■ x G u, m £ R(x)}, respectively. 

Definition 27 (Fractional Message Assignment). Let N = (v,e,f/,,c,A,S,R) be a capacitated 
network and mi, . . . , m^i be the messages. A fractional message assignment is a collection of maps 
a = (ai, . . . , oli) where ai is a message assignment for rrii, a\ : mj — > F ki . 

Definition 28 (Fractional Network Code). Let N = (u,e, fj,,c,A, S, R) be a capacitated network 
and mi, . . . , be the messages in fx. A fractional network code on M is a 5-tuple (F, k, n, J- e , Fd) 
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where 

1. F is a finite field, with \F\ > \A\, 

2. k = (ki, . . . , feui) is a source dimension vector, 

3. n is an edge dimension, 

4- F e is a collection of fractional edge functions on M , 
5. J~d is a collection of fractional decoding functions on M . 

As with the ordinary network codes in Chapter [TJ we have different kinds of fractional network 
codes defined analogously: fractional routing network codes, fractional linear network codes, and 
fractional nonlinear network codes. We shall use the prefix (k, n) before codes to emphasize the 
parameters k and n. 

Definition 29 (Fractional Network Code Solution). Let M = (y, e, fx, c, A, S, R) be a capacitated 
network and mi, . . . , mui be the messages. A fractional network code (F, k, n, F e , Fd) is a fractional 
network code solution, or fractional solution for short, if for every fractional message assignment 
a = (ai, . . . , a| M |), 

fx,m j {P'i 1 {'mi l ), . . . ,a ia (m ia ),s(e a+ i), s(e a+/S )) = aj(rrij), 

for all x G v and rrij G R(x) . Note that , . . . , mi a are a messages generated by x and 
e a +i, • • • , e a +/3 are (5 in-edges of x. If the above equation holds for a particular x G v and message 
m G R(x), then we say node x's demand m is satisfied. 

As with network code solutions for ordinary networks, we have special classes of fractional 
network code solutions: fractional routing network code solutions, fractional linear network code 
solutions, and fractional nonlinear network code solutions. When it is clear from the context, we 
refer to them by appropriate abridged versions from time to time. 

If (F, k, n, T e , Fd) is a fractional network code solution for M = (y, e, fi, c, A, S, R), source node 
x G v sends a vector of k% symbols from F for each message rrii G S(x); each receiver node x G v 
demands the original vector of ki symbols corresponding to message nii for each rrii G R{x); and 
each edge e carries a vector of c(e)n symbols. We refer to coordinates of the symbol vector of 
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length ki corresponding to message mi as message nii 's coordinates. Note that each coordinate of a 
message is independent from others. We use coordinates and symbols for messages interchangeably; 
the i-th symbol of message m refers to the i-th coordinate of the message. We refer to coordinates 
of the symbol vector of length c(e)n on edge e as edge e 's coordinates. Note that a coordinate of 
edge e can be active, meaning it actively carries a symbol in the fractional network code solution, or 
inactive, meaning it is not used in the solution. For instance, if an edge with 5 available coordinates 
has to send 2 independent symbols through the edge, then it suffices to use only 2 coordinates; in 
this case, the coordinates that carry symbols are active and the other 3 coordinates are inactive. 
We define a notion of minimal network code solutions as follows: 

Definition 30 (Minimal Fractional Network Code Solution). A fractional network code solution 
{F,k,n,F e ,Fd) for N is minimal if the set A of all active coordinates of edges in the solution is 
minimal, i.e., there exists no (k,n) fractional network code solution for M with the set of active 
coordinates that is a strict subset of A. 



3.2 Network Capacity Regions 
3.2.1 Definitions 

Definition 31 (Achievable Coding Rate Vector). Let N = {v, e, /i, c, A, S, R) be a capacitated 
network. A vector of positive numbers . . . , £ Q+' is an achievable coding rate vector if 
there exists a fractional network code solution (F, k, n, T e , Td) for M where k = (k\, . . . , fc| M |). 

Definition 32 (Network Capacity Region). Let N = (v,t, [i,c,A, S, R) be a capacitated network 
and mi, . . . , be the messages. The network capacity region C of M is the closure of all achiev- 
able coding rate vectors in M)^, 



C = closure < — : — 
n n 



( — ,..., is an achievable coding rate vectors 1 . 

\n n J j 



By definition, a network capacity region is a set of points in the Euclidean space ' . 

There are different classes of achievable coding rate vectors and, hence, corresponding classes 
of network capacity regions: the network routing capacity region, C r , which is the closure of all 
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achievable routing rate vectors; the network linear coding capacity region, C\, which is the closure 
of all achievable linear coding rate vectors; and the network nonlinear coding capacity region, C, 
which is the closure of all achievable nonlinear coding rate vectors (or the network capacity region, 
equivalently) . 

3.2.2 Properties 

We show that the network capacity regions are closed, bounded and convex sets and satisfy an 
additional property. 

Theorem 33. Let M = {y, e, fx, c, A, S, R) be a capacitated network and mi, . . . , m^j be the mes- 
sages. The corresponding network capacity region C is a closed, bounded and convex set in ' . 

Proof. (Closedness) By definition, C is a closure of a set and, hence, closed. 

(Boundedness) We show that — is bounded for all i in the achievable coding rate vector 




By symmetry, it suffices to show for -f . Let n be the edge dimension, u\ be 



the set of nodes in v that generate message m\ and 7 be the sum of capacities of out-edges of nodes 
in v\. Then, k\ < 777, as we cannot send more than 771 independent coordinates of message mi and 
expect receivers to recover all the information. Hence, ^ < 7. It follows that C is bounded. 

(Convexity) Let xq,xi G C and A G [0, 1]. We show that x = (1 — A)xo + Axi G C. We write 
x = xq + A(xi — xq). There exists sequences of achievable coding rate vectors converging to xo 
and x\, say {yo,j} and {yij} respectively. Let {Xj} be a sequence of rationals converging to A. 
Then, yj = j/oj + ^j(yi,j — 2/0, j) is an achievable coding rate vector for j = 1, 2, . . .. Let Xj = |, 
Voj = and yi ,,= (£,..., Then, 

_ ( (q - p)hn' + pk[n (v ~ P) k \»\ n ' + P k ' M n \ 
^ I qnn' ' ' qnn' J 

There exists a fractional network code solution (F,k,qnn' ,F e ,Fd) where k = ((q — p)k\n' + 
pk[n, . . . , (q — p)k\^n' + pk'^n); if NC\ and NC2 are two fractional network code solutions with 
rate vectors yoj and y±j, then for first {q — p)nn' coordinates we employ (q — p)n' copies of NC\ 
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and for the remaining pnn' coordinates we employ pn copies of NC2- Then, 

\ x -Vj\ = \ x o + A(xi - x ) - y 0jj - Xj(yij - yoj)\ 
= |(1 - A)x - (1 - Aj)y ,j + Axi - Xjyijl 
< |(1 - X)x - (1 - Xj)y ,j\ + \Xx! - Xjy hj \. 

Since Xj — > X and yo,j x o, (1 — Xj)yoj converges to (1 — X)xq and |(1 — A)xo — (1 — Xj)yoj \ can 
be made arbitrarily small for sufficiently large j. Similarly, |Axi — Xjyij\ can be made arbitrarily 
small for sufficiently large j. It follows that the sequence of achievable coding rate vectors {yj} 
converges to x and that x G C. □ 

Corollary 34. Let M = (v, e, fi, c, A, S, R) be a capacitated network and mi, . . . , m| M | be the mes- 
sages. The corresponding network routing capacity region, C r , and network linear coding capacity 
region, C\, are closed, bounded and convex regions in 

We note that the network capacity regions are of very special kind by definition: 

Proposition 35. The network capacity region C is a region such that iff = (n, . . . ,?*|^|) S C, then 
the parallelepiped [0, r±] x . . . x [0,r| M |] is contained in C. The same holds for the network routing 
capacity region C r and the network linear coding capacity region Ci . 

We use bdC to denote the boundary of the network capacity regions. Similarly, we use bdC r 
and bdCi to denote the boundaries of corresponding regions. 

3.3 Network Routing Capacity Regions 

In this section, we prove that the network routing capacity region is a bounded rational polytope, 
and provide exact algorithms and approximation heuristics for computing it. Since multi-source 
multi-sink networks can be reduced to multiple multicast networks, it suffices to show the results 
with respect to the multiple multicast networks; for each message m, we add a "super source node" 
that generates the message m and connects to source nodes that generate m via edges of infinite, 
or sufficiently large, capacities. We assume that the given networks in this section are multiple 
multicast networks for simpler presentation of results. 
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3.3.1 Properties 

Theorem 36. The network routing capacity region C r is a bounded rational polytope in and is 
computable. 

Proof. (Polytope) It suffices to consider minimal fractional routing solutions since any fractional 
routing solution can be reduced to a minimal one by successively making unnecessary active edge 
coordinates inactive. For each coordinate of a message m, it suffices to route it along a Steiner 
tree rooted at the source node of m and spanning all the receiver nodes demanding m. Hence, any 
minimal fractional routing solution consists of routing messages along Steiner trees. Let % be the 
set of all Steiner trees rooted at the source node of message mi and spanning all receiver nodes 
that demand mj, and T be the union, T = T\ U . . . U T^y Note that T is a finite set. Then, any 
minimal fractional routing solution (F, k, n, T e , J^d) satisfies the following constraints: 

Y,TeT T ( e ) • X ( T ) ^ c ( e ) • n > Ve G 6 
£ T67i x(T) = h, VI < z < | M | 

x > 0, 

where x(T) is the number of times Steiner tree T is used in the solution and T(e) is an indicator 
that is 1 if the Steiner tree T uses the edge e, or otherwise. Dividing all variables x(T) by n, we 
obtain 

£ Ter T(e)-x(T) < c(e), Ve G e 

Ere% x ( T ) = I' Vl<i<H 

x > 0. 

It follows that all minimal fractional routing solutions, after scaling by n, satisfy 

Erer T ( e ) • < T ) < c ( e )> VeGe 
x > 0. 

As the coefficients are in Q, the above set of inequalities defines a bounded rational polytope V, 

ITI 

with rational extreme points, in M + . The polytope is bounded, because edge capacities are finite 
and no Steiner tree can be used for routing for infinitely many times. Each minimal fractional 
routing solution reduces to a rational point inside the polytope V, and each rational point x inside 
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V has a minimal fractional routing solution (F, k,n,J r e ,J-' c i) that reduces to it, such that 




To see the latter statement, we take a rational point in V, put rationals under a common denomina- 
tor, and choose appropriate k and n. As rational points are dense, the closure of the rational points 
corresponding to minimal fractional routing solutions is exactly V . It follows that the network 
routing capacity region C r is the image of V under the affine map 



As the affine map preserves rationality, it follows that the network routing capacity region is a 
bounded rational polytope in M^' . 

(Computability) We show that we can compute the vertex description (the extreme points) of 
the polytope C r . We compute the vertices v\ , . . . , of polytope V by any vertex enumeration 
algorithm where the starting point can be any point that corresponds to using a single Steiner tree 
for the maximum number of times allowed by the network. We compute the images of the vertices 
of V under the affine map ip r . The network routing capacity region is given by the vertices of the 



The network routing capacity defined by Cannons et al.[2] corresponds to a point on the bound- 
ary of polytope C r ; it is exactly the intersection point between the (outer) boundary bdC r and the 
ray x = (1, . . . , l)t, t > 0. As the ray has a rational slope, the intersection point is rational and, 
hence, Corollary IV. 6 in [2j follows straightforwardly. We use V r to denote the "parent" polytope, 



3.3.2 Algorithms 

We provide exact algorithms and approximation heuristics for computing the network routing 
capacity region C r . This subsection goes together with next two subsections, so we advise the 
reader to refer to these subsections as necessary. We assume that a capacitated network N is 
given if not stated explicitly. We already provided an exact algorithm for computing C r in the 





□ 



in Theorem |36[ of the network routing capacity region C r . 
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proof of Theorem 36 which we refer to as Algorithm VertexEnum_Route. The algorithm takes the 

hyperplane description of V r and outputs the hyperplane description of C r . Since the polytope 

I7~l 

V r is defined in a high dimensional space W + where \T\ could be exponential in the description 
size of networks, Algorithm VertexEnum_Route may not be efficient in practice as there could be 
exponentially many vertices. 

Algorithm 1 Algorithm VertexEnum_Route(A/') 
1: Form the hyperplane description of polytope V r - 

2: Compute the vertices of V T with a vertex enumeration algorithm and obtain vi, . . . , Vh- 
3: Compute the image of the vertices, ip r (vx), . . . , ip r (vh)- 
4: return convex hull of tjj r (vi), . . . , ip r (yh)- 



To design more efficient algorithms and approximation heuristics, we recast the computation of 
the network routing capacity region as the polytope reconstruction problem with a ray oracle and 
use related results in literature [5j [15] . More specifically, we formulate the reconstruction problem 
as follows: 

Definition 37 (Polytope Reconstruction Problem). Let Q be a polytope in M. d that contains the 
origin in its interior and On ay be a ray oracle that given a ray of the form x = ft,t > 0, computes 
the intersection point between the ray and the boundary of Q. Compute a polytope description of 
Q using a finite number of calls to the oracle 0^ ay . 

We reduce the computation of the network routing capacity region C r to a polytope reconstruc- 
tion problem by 1) reflecting C r around the origin to get a symmetric polytope Q in M.^ that 
contains the origin in its interior and 2) solving the linear programs similar to the one in Cannons 
et al. [2] to implement the ray oracle On ay . To reflect C r , we map all calls to the ray oracle to 
equivalent calls with rays defined in R^ 1 . We use the algorithm outlined in Section 5 of Gritzmann 
et al. [15J to compute all the facets of the resulting polytope Q and recover the facets of the network 
routing capacity region C r . We refer to the overall algorithm as Algorithm FacetEnum_Route. The 
main idea of the algorithm is to first find a polytope Q! that contains Q and whose facet-defining 
hyperplanes are a subset of those for Q (Theorem 5.3 in [E]), and then successively add more 
facet-defining hyperplanes of Q to Q' by using OR ay . In other words, we start with a polytope that 
contains Q and successively shrink it until it becomes Q. By Theorem 5.5 in Gritzmann et al. |15j . 
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we need at most 

HQ) + (H - 1)/h-i(Q) + (5|mI " 4)/ H _i(Q) 

calls to the ray oracle 0R a?/ to compute the facets, where /j(Q) denotes the number of i-dimensional 
faces of Q (the 0-th dimensional faces being the points). Because of the symmetries around the 
origin, we need at most 

fo(C r ) + (| M | - l)ft hl (Cr) + (5| M | - 4)f M -l(Cr) 

calls to the ray oracle where fi(C r ) denotes the number of i-dimensional faces of C r that do not 
contain the origin. 

Algorithm 2 Algorithm FacetEnum _Route(AA, Ou ay ) 

\T\ 

1: Form the hyperplane description of V r in W + '. 

2: Internally, reflect C r around the origin to get the polytope Q in M^L 
3: Using Ojiay, compute a polytope Q' containing Q. 
4: while Q! has undetermined facets do 
5: Compute the vertices of Q! '. 

6: Using OR a y, compute the intersection points on rays defined by the vertices of Q' . 
7: Add newly found facet-defining hyperplanes of Q to Q' . 
8: end while 
9: Retrieve facets of C r . 
10: return the facet description of C r . 

Depending on the implementation of OR ay: we get exact algorithms and approximation heuris- 
tics for computing C r . If we use an exact algorithm for the ray oracle On a y, we get an exact 
hyperplane description of the network routing capacity region via Algorithm FacetEnum_Route. If 
instead we use an approximation algorithm for the oracle that computes some point r such that 
the actual intersection point lies between r and Ar, then we obtain approximation heuristics that 
compute a set of points r such that the boundary bdC r lies between points r and Ar. We note that 
an approximation algorithm for OR ay does not necessary work with Algorithm FacetEnum_Route 
to give an approximation algorithm for C r , where an A- approximation of C r would be a polytope 
V such that V C C r C AV . While an approximation algorithm for the oracle OR ay does not 
necessarily lead to a polytope description of C r , it might be faster and more efficient than exact 
algorithms and, hence, more applicable to compute a quick "sketch" of the capacity region C r . One 
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approximation heuristic for computing the region C r would be to take a sufficiently large number 
of rays evenly spread apart throughout the space and use an approximate oracle On ay to find 
the approximate intersection points. As there are many simple variations of this approach, we do 
not go into the details of the heuristics themselves in this work. 

3.3.3 Implementations of Exact and Approximate Oracle On ay 

In this subsection, we provide both exact and approximation algorithms for the ray oracle On ay 
used in Algorithm FacetEnum _Route. The implementations of the oracle reduce to solving a linear 
program. We use any linear programming algorithms, such as the ellipsoid algorithm and simplex 
algorithm, to solve the linear program exactly and obtain an exact oracle On ay . For the approximate 
ray oracles, we design a combinatorial approximation algorithm using techniques by Garg and 
Konemann [T3] . Alternatively, we could use the ellipsoid algorithm with an approximate separation 
oracle, but as the ellipsoid algorithm is slow in practice, this might not be a viable approach. As 
the network routing capacity region C r is a rational polytope, it suffices to consider rays with a 
rational slope in Q^'. 

Algorithms 

Given the hyperplane description of the polytope V r , 



and a ray with a rational slope, x = qt, t > 0, we want to compute the rational intersection point 
of the ray and the boundary of C r . It is straightforward to see that the intersection point is exactly 
^maxQ where A max is the optimal value to the linear program 



J2reT T ( e ) " x ( r ) ^ c ( e )' Ve£f 



x 



> 



max A 
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Since the coefficients of the linear program are rational, the optimal value A max and the corre- 
sponding solution x are also rational. While written in a different form, the linear program is 
equivalent to the one in Cannons et al. [2] when the ray is x = (1, ... , t > 0. We use any linear 
programming algorithm, such as the ellipsoid algorithm and simplex algorithm, to solve the above 
linear program exactly and obtain Algorithm Or acleRay Exact _Route. We note that the running 
time of Algorithm OracleRayExact_Route could be poor as the linear program has exponentially 
many variables (one for each Steiner tree) and the associated separation problem is NP-hard (the 
minimum cost directed Steiner tree problem). 

Algorithm 3 Algorithm OracleRayExact_Route(A/", q) 



Form the linear program (3.3.1) corresponding to N and q. 

Solve the linear program with a linear programming algorithm and obtain A r 

return A max g. 



We now provide a combinatorial approximation algorithm for solving the linear program (3.3.1 ) 



and, hence, for oracle 0^ ay . It computes a point r such that \ m axq is on the line segment between 



f and (1 + u))Ar for some numbers ui > and A > 1. The main idea is to view solving (3.3.1) as 
concurrently packing Steiner trees according to the ratio defined by q, and use the results for the 
multicommodity flow and related problems by Garg and Konemann [13] . Instead of a shortest path 
algorithm, we use a minimum cost directed Steiner tree algorithm for our purpose. We assume 
we have an oracle Oosteiner that solves the minimum cost directed Steiner tree problem, which is 
well-known to be NP-hard, within an approximation guarantee A: 

Definition 38 (Minimum Cost Directed Steiner Tree Problem). Given an acyclic directed multi- 
graph Q = (y, e), a length function I : e — >■ M + , a source node s and receiver nodes n±, . . . ,nk, find 
a minimum cost subset of edges e' such that there is a directed path from s to each ni in e' . The 
cost of a subset e' is X^eee' K e )- 

First, we consider networks with exactly one message to route. In this case, the linear pro- 



gram (3.3.1) reduces to the following simpler linear program: 



max J2reT x (T) 

s.t. E Te r T ( e ) - X ( T ) ^ c ( e )> VeGe ( 3 - 3 - 2 ) 
x > 0, 
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where T is the set of all Steiner trees rooted at the source node of the message and spanning all 
the receiver nodes demanding the message. Note that the original problem now reduces to the 
fractional directed Steiner tree packing problem (compare to the fractional Steiner tree packing 



problem in [IT]). To solve (3.3.2), we use Algorithm DSteinerTreePacking which is a straightfor- 
ward modification of the maximum multicommodity flow algorithm given in Section 2 of Garg and 
Konemann [13]. In Algorithm DSteinerTreePacking, mincost(Z) denotes the cost of the (approxi- 
mate) minimum cost directed Steiner tree found by Oosteiner- 

Algorithm 4 Algorithm DSteinerTreePacking(A/", u, Od steiner, A) 

1: rj=^oj;8=(l+ri)((l + r l )L)- 1 ^ 
2: / = 0; 1(e) = <5,Ve € e 
3: while mincost(Z) < A do 

4: Use Oosteiner to compute an approximate minimum cost Steiner tree T, under the length /. 
5: d = min e . f (e)=1 c(e) 
6: f = f + d 

7: Update /(e) = /(e) (l + r/^y) , Ve s.t. T(e) = 1. 

8: end while 

9: return // log 1+J? ^ 

Essentially the same analysis in [13] works for Algorithm DSteinerTreePacking except that our 
approximation guarantee is worse by a factor of A since we use an approximate oracle O d steiner- 
We omit the analysis and summarize the performance of the algorithm as follows. For computations 



involving r] and u, we refer to Appendix A. 1.1 



Theorem 39. ForO <u <1, Algorithm DSteinerTreePacking computes a (1+uj) A- approximate 
solution to the linear program (3.3.2) in time O(o; _2 |e| logL • Tjjsteiner), where L is the maximum 



number of edges used in any Steiner tree in T and Tjjsteiner is the time required by oracle Oosteiner 
to solve the minimum cost directed Steiner tree problem within an approximation guarantee of 
A>1. Note that L < |e|. 

We now give Algorithm OracleRayApprox_Route, for oracle OR ay , which computes a (1 + co)A- 



approximate solution to the linear program (3.3.1). It uses Algorithm DSteinerTreePacking as a 
subroutine. 



42 



Algorithm 5 Algorithm OracleRayApprox_Route(A/', u, Ousteiner, A, q) 
1: Using Algorithm DSteinerTreePacking, compute the approximate value Z{ to the linear pro- 



gram (3.3.2) for each message m« separately. 



z = minj — ; scaling -factor 



q = scaling _/ actor ■ q 
n = ^;5 = (\e\/(l-nA)r^A 



N = 2 



V A lo Sl+»? 1- V A 
while true do 

i = °; l (e) = 4),VeGe 
for phase i = 1, . . ., N do 
for iteration j = 1, . . . , \f/,\ do 

7 = ?j 

while 7 > do 

Using Onsteiner-, compute the minimum cost directed Steiner tree T G Tj, under I. 
d = min{7, c(e) : T(e) = 1} 
7 = 7 — (f 

^ _ 7^ ^ i _l ^, 

'c(e) 

if EeKe)c(e) > 1 then 

Goto Line[25j 
end if 
end while 
end for 
t = t + 1 
end for 

scaling -factor = 2scaling_ factor; q = 2q 
end while 

return scaling _f 'actor ■ t/log 1+rj | 



Update l{e) = 1(e) ( 1 + n-f-r ) ,Ve such that T(e) = 1. 
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Analysis 

While Algorithm OracleRayApprox_Route is closely related to the maximum concurrent flow algo- 
rithm given in Section 5 of Garg and Konemann |13j , there are significant differences and we give an 



analysis of the algorithm below. The linear program (3.3.1 ) can be thought of as the "concurrent" 
fractional directed Steiner tree packing problem. We have directed Steiner trees partitioned into 
different groups according to the network messages. Then, computing the optimal value of the 



linear program (3.3.1) is equivalent to fractionally routing along the Steiner trees so that the ratios 
among the overall usages of groups of Steiner trees correspond to the ratios among the coordinates 
of the q vector. 



The dual linear program corresponding to (3.3.1) is 



min Yy e l( e ) c ( e ) 

e.)-z(i) < 0. VI < i < lul.VT G 77 

(3.3.3) 



s -t- Ee:T{e)=l l (e) ~ z(i) < 0, VI < % < \fx\ , VT G Ti 



l,z > 0. 

From the dual linear program, we note that the associated separation problem is exactly the 
minimum cost directed Steiner tree problem. Let mincostj(Z) denote the cost of the minimum 
cost directed Steiner tree in T% under the length function /. Define D(l) = ^2 e&e l(e)c(e) and 
a(l) = Xyi=i 9imincostj(Z). Then, solving the dual linear program is equivalent to finding an 

D(l) 
a(l) 

We first consider a modified version of Algorithm OracleRayApprox_Route, with the infinite 
while-loop in line 6 removed, with variable scaling .factor and line 23 removed, and with the finite 
for-loop in line 8 replaced with an infinite while-loop. The following holds: 

Theorem 40. Assume f3 > 1. For < u> < 1, Algorithm OracleRayApprox_Route, with the 



assignment of lengths to edges, I : e — > M + , so as to minimize Let /? be the optimal value of 

the dual linear program, i.e., /3 = min; 



modifications, returns a (1 + u) A- approximate solution to the linear program (3.3.1) in at most 
number of phases. 



V A 10 Bl+r; i-r,A 



Proof. Algorithm OracleRayApprox_Route in lines 8-22 proceeds in phases which in turn consist of 
|/i| iterations. Each iteration consists of variably many number of steps of the while- loop in line 11, 
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depending on how quickly 7 is decreasing. In each j-th iteration in line 9, qj units of message rrij 
are routed using Steiner trees in Tj. By lines 16-18, the algorithm terminates as soon as D(l) > 1. 
Let t be the last phase in which the algorithm terminates. Let Z*" 1 denote the length function I 
and the variable 7 at the start of s-th step of the while-loop (line 11) of j-th iteration in phase 
i. Let T?- denote the Steiner tree T selected in the s-th step of j-th iteration in phase i. Let Z^o 
be the length function at the start of phase i and Zj 1^1 be the length function at the end of phase 
i (after the termination of the |/x|-th iteration). Note that k+ifi = k,\fi\- Let hj-i be the length 
function at the start of iteration j of phase i. For simplicity, we denote Dfy^) and a(/j | M |) by 
D(i) and a(i), respectively. Then, 

e 

<^7 1 ) + ^7 1 

Then, 

D{l iJ+ i) < Dik^+rfAj^dtJ 1 ~ 7ij)mincost j+ i(Z iJ+ i) 

s 

= D(lij) + r/Aq j+1 mincost J+ i(Zij + i) 
< D(kj) + rjAq j+1 mincostj + i(Z ij | j[1 |), 

where we used the fact that mincostj(Zf ■) < mincost^Zjj+i) as the length function and mincost 
are nondecreasing in any fixed argument throughout the algorithm. After summing up the above 



e:T? j (e)=l 

- -yf -)A mineral;, (ijj 1 ). 
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inequalities over the iterations of phase i, we get 



D{l itM ) < + r]Aq M mincost^^j^i) 

< D ih,\n\-^ + vMQM-i mincost |M|-i(^|/.|) + ?M mincost |/i|( z i,|M|)) 



ImI 

< D(l ifi ) +7]A^2q j m.mcost j (l ii \ fl \), 

3=1 

and it follows that 

D(i) < D(i - l)+r]Aa(i). 
Since ^® > /?, it follows that -D(i) < x-r}A/p • Since -D(O) = |e|<5, we have for j > 1, 

l-riA/p\ P~V A J 

\e\5 7;A(i-i) 

< — e z 3 - 1 ?- 4 

- 1-riA/p 

\e\8 yA(i-i) 

< — — efP-" 1 ) , 

_ 1 - T]A 

where the last inequality uses the assumption that (3 > 1. The algorithm terminates in phase i for 
which D(t) > 1. Hence, 

1 < D(t) < 1 1 e P(.l-vA>, 
~ W ~ 1 - 7] A 

And it follows that 



P / 7/A_ 



< 1 — r- (3-3.4) 



In the first i — 1 phases, we have routed (t — l)qj units of message rrij, for j = 1, . . . , This 
routing solution may violate the edge capacity constraints, but, by the following claim, we obtain 
a feasible solution with A > , — , s . See Claim 5.1 in [13] for the proof of the claim as the same 
proof works here. 



Claim 41. There exists a feasible solution with A > 



t-i 
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Thus, the ratio of the values of the optimal dual and feasible primal solutions, £, is strictly less 
than jfj log 1+7? |. By ( |3.3.4| ), we get 



< 77^ log 1+7) | _ r/A In 1/5 



l-^l n i=j^ (l-»7A)ln(l + v) In ±=j^ 



A.1.2 



For 5 = (|e|/(l — r]A)) 1 l r i A , the ratio \1 v a equals (1 — nA) 1 and, hence, 

c < ^ < < n - T7/ir 3 4 

^ " (l-7/A) 2 ln(l + 7/) " (I-t?^) 2 ^ -?? 2 /2) ~ 

For r/ = g^w, (1 — r/A)~ 3 A is at most our desired approximation ratio (1 + lu)A (see Appendix 
for details). By weak-duality, we have 

P , 1 
l<C<^ I log a+ ^ 

and, therefore, the number of phases in line 8 is strictly less than log 1+rj 1/5, which implies that 



Algorithm OracleRayApprox_Route terminates in at most ^ log 1+J? fzwg 



number of phases. □ 



Note that by Theorem 40, the running time of Algorithm OracleRayApprox_Route, with the 
modifications, depends on f3. Note that j3 can be reduced/increased by scaling the q vector /capac- 
ities appropriately. We now remove the assumption f3 > 1 and analyze the running time of Algo- 
rithm DracleRayApprox_Route clS cl whole. 

Theorem 42. For < u) < 1, Algorithm DracleRayApprox_Route computes a (1 + w) A- approx- 



imate solution to the linear program (3.3.1) in time 0(u~ (|/x| log A\fi\ + |e|)^41og|e| • Tosteiner), 
where Tosteiner is the time required to solve the minimum cost directed Steiner tree problem with 
oracle Od steiner within an approximation guarantee A. 

Proof. To remove the dependency of the running time on /3, we update the q vector and variable 
scaling -factor appropriately. Let z* be the exact maximum fractional Steiner tree packing value 
for message mi in line 1 and let z* = minj — . Then, z* is an upper bound on the maximum rate 

Hi 

at which the messages can be routed in a minimal fractional routing solution. Since z% < z* < Azi, 
r^T < ft < Az. We scale the q vector as in line 3 so that 1 < ft < A\n\. The assumption j3 > 1 is 
satisfied, but j3 could now be as large as A\fjb\. We employ the doubling trick as explained in Section 
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5.2 in [13] and line 23 accomplishes this, together with N = 2 \ log 1+77 \1\a ™ ^ ne for-loop in 
line 8. Since we halve the "current" value of j3 after every TV phases, the total number of phases 
is at most NlogA\fi\. Since there are \//,\ iterations per phase, there are at most N\//,\ log A\fi\ 
iterations in total. Each iteration consists of variably many number of steps and, in all steps but 
the last, we increase the length of an edge by a factor of 1 + r/. By similar reasons as in the proof 
of Claim 5.1 in |13j . the length of each edge can increase by a factor of 1 + r/ at most log 1+ „ | times 
throughout the algorithm. Hence, the total number of steps of the while- loop in line 11 exceeds 
the total number of iterations by at most |e| log 1+J? j. The total number of steps, hence calls to the 
oracle DS teiner, is at most N\n\ log + |e| log 1+r? |. 
Note that r? = 0(|). Then, 

N\n\logA\p\ + |e|log 1+r/ i = N\n\]ogA\n\ + log 1+7? y~^A 

_ / lyul log AI/J.I lei lei lei \ 

= ° { v a log ^ i^nl + logl+ " T^nA J 

_ / \n\\og A\fi\ + |e| ln|e| + ln(l/(l-7?^)) \ 
V rjA ln(l + J 

< Q / Hjog^ + | € | _ ln|e|+ln(l/(l-7?^)) \ 

_ \ 7]A 7] — 7] 2 /2 J 

_ log A\fi\ + |e| In |e| + In 1/(1 - ui) \ 

~ V w ' w/A-(a;M) 2 /2 J 

= 0(o;- 2 (|HlogA| / x| + |e|)^log|e|) 

□ 

3.3.4 Implementations of Oracle Oosteiner 

The oracle Ousteiner solves the minimum cost directed Steiner tree problem (Definition 38). A brute 



force approach is to loop through all possible subsets of e and select one with the minimum cost that 
satisfies the directed Steiner tree conditions. Since checking whether a subset of edges supports a 
directed path from the source to each receiver node can be done in 0(\v \ + |e|) time, the brute force 
approach has the total running time of 0((M + |e|)2l e l) and finds an exact optimal solution. We can 
also compute an 0(A;)-approximate solution by computing a shortest path from the source to each 
receiver and combining the paths to form a tree, where k is the number of receivers. A shortest 



48 



path can be computed efficiently and there are many shortest path algorithms. For instance, 
the Dijkstra algorithm suffices for our purpose and gives an 0(fc)-approximate solution in time 
0(M(IH + l e l) 1°S M) with a binary minheap. There exists an efficient approximation algorithm for 
the minimum cost directed Steiner tree problem with a significantly better approximation guarantee 
by Charikar et al. |4j. Charikar et al. designed a family of algorithms that achieves an approximation 
ratio of i(i — in time 0(n*fe 2 *) for any integer i > 1, where n is the number of nodes and k is 

the number of receivers. For our problem, k < \v\ and n < \v\ and we get algorithms that achieve 
an approximation ratio of i(i — l)!^ 1 / 1 in time 0(|^| 3j ) for any integer i > 1. For i = \og\u\, we 
obtain an approximation ratio of 0(log \v\) in time of 0(M 31 °sM). We summarize with Table 
Note the tradeoff between the approximation ratio A and the running time. 

Table 3.1: Implementations of oracle Op steiner 



3.1 



Algorithm for DS teiner 


Approximation Ratio A 


Time 


BruteForce 


1 


0((M + |e|)2N) 


ShortestPathApproximation 


o(H) 


o((|H 2 + IHkl)iog|^|) 


Charikar et al. [4j 




0{\u\ 3i ) 




0(log 2 \u\) 


0(|jy| 31 °glH) 



3.4 Network Linear Coding Capacity Regions 

In this section, we show a computable inner bound on the network linear coding capacity region Ci 
with respect to a given finite field. In particular, we show how to compute a polytope C>, which we 
call the semi-network linear coding capacity region, that is contained in C\. If C[ is strictly bigger 
than C r , then linear coding helps improve the information throughput through the network. It is 
unknown at this time how good of an approximation the polytope C[ is to the actual network linear 
coding capacity region C/. Unlike in the computation of the network routing capacity region, the 
finite field is important in the computation of C[. We assume that a network J\f, not necessarily a 



multiple multicast network as in Section 3.3, and a finite field F are given in what follows, if not 
stated explicitly. 
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3.4.1 Definitions 

Definition 43 (Weight Vectors and Partial Scalar-Linear Network Code Solutions). Let N be a 

network with unit edge capacities and m%, . . . ,Tn\^\ be the messages. The weight vectors associated 
with N ', or simply weight vectors, are vectors w in {0, such that there exists a scalar-linear 
network code solution for J\f when only messages mi with Wi = 1 are considered, i.e., for N with 
the new message set // = {m; : W{ = 1}. We refer to the scalar-linear network code solutions 
corresponding to these weight vectors as partial scalar-linear network code solutions, or partial 
scalar-linear solutions for short. 

Note that by definition, Steiner trees are also partial scalar-linear network code solutions. 

Definition 44 (Simple Fractional Linear Network Code Solution). Let N = (z/, e, [i, c, A, S, R) be 

a capacitated network and mi, . . . , mui be the messages. A fractional network code (F, k, n, J- e , J~d) 
is a simple fractional linear network code solution, or simple fractional linear solution for short, 
if the fractional network code is linear over the finite field F and can be decomposed into a set of 
partial scalar-linear solutions of N (when considered with unit edge capacities). 

Definition 45 (Semi-Network Linear Coding Capacity Region). Let N = (v, e, fi,c, A, S, R) be 
a capacitated network and mi, . . . , m^ be the messages. The semi-network linear coding capacity 
region C[ of N is the closure of all coding rate vectors achievable by simple fractional linear network 
code solutions. Note C[ C M^' . 

Clearly, the network linear coding capacity region C\ contains the semi-network linear coding 
capacity region C[ as the set of fractional linear code solutions is a superset of the set of simple 
fractional linear code solutions. 



3.4.2 Properties 

Theorem 46. Assume a finite field F is given. The semi-network linear coding capacity region C[, 
with respect to F , is a bounded rational polytope in Kir and is computable. 



Proof. (Polytope) The proof is similar to the proof of Theorem 36. Let J\f = (v,e, fi,c,A, S, R) 
be a network. Let wi, . . . , Wk> be all the weight vectors associated with N . Let Wi be the set 
of all partial scalar-linear network code solutions that satisfy the demands corresponding to the 
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weight vector Wi and W be the union, W = Wi U . . . U Wy- Note that WVs are finite nonempty 
disjoint sets. Then, any simple fractional linear code solution (F, k, n, F e , J 7 ^) can be decomposed 
into partial scalar-linear solutions in W and satisfies the following constraints: 



EweW W ( e )' x ( W ) ^ c ( e )- re ' VeGe 
EiliEwewJ^W^O = k h Vl<i<|/x| 
x > 0, 

where is the number of times the partial scalar-linear solution W is used in the fractional 

linear solution and W{e) is an indicator that is 1 if the solution W uses edge e, or otherwise. 
After dividing all the variables :c(VF) by n, it follows that all simple fractional linear code solutions 
satisfy 

EwewW(e)-x(W) < c(e), Ve e e 
x > 0. 

Using the affine map 

/ k' k' 



=i weWi i=i weWi 



we follow the similar lines of reasoning as in Theorem 36 to show that the semi-network linear 
coding capacity region C[ is a bounded rational polytope in M^' . 



(Computability) We apply the same proof for C r to C[ using the inequalities in (3.4.1). □ 



We use V', to denote the "parent" polytope of C[ defined by (3.4.1) 



3.4.3 Algorithms 

We obtain algorithms and heuristics for computing the semi-network linear coding capacity region 



C[ from algorithms and heuristics in Section 3.3.3 with little modifications; we use the polytope 
description of V\ instead of V T and use the ray oracle 0^ ay for C[. We denote the resulting algorithms 
by Algorithms VertexEnumJLCode and FacetEnumJLCode. We omit the details of the algorithms. 
This subsection goes together with next two subsections. 
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3.4.4 Implementations of Exact and Approximate Oracle On ay 
Algorithms 

We provide the implementations of oracle 0^ ay for the semi-network linear coding capacity region 
C[. As the region C[ is a rational polytope, it suffices to consider rays with a rational slope. Given 
the hyperplane description of the polytope V[, 

Zw&vW(e) ■ x(W) < c(e), Ve G e 
x > 0, 

and a ray with a rational slope of the form x = qt, t > 0, we would like to compute the rational 
intersection point of the ray and the boundary of polytope C[. The intersection point is X max q 
where \ ma x is the optimal value to the linear program: 

max A 

s-t. EwtwW(e)-x(W) < c(e), Ve G e 

fe' (o-4.2J 

s,A > 0, 

where w\, . . . ,Wk' are the weight vectors associated with network A/". Since the coefficients of 
the linear program are rational, the optimal value \ ma x and the corresponding solution x are 



rational. We can use any linear programming algorithm to solve (3.4.2) exactly, as in Algorithm 
OracleRay Exact .Route, and obtain Algorithm OracleRayExact.LCode. We omit the pseudocode. 
Using techniques by Garg and Konemann [13] . we provide a combinatorial approximation algo- 



rithm, Algorithm DracleRayApprox_LCode, for solving the linear program (3.4.2) approximately. 



While the linear program (3.4.2) looks similar to the linear program (3.3.1), it is much harder to 
solve. The algorithm computes a point f such that X m axQ is between f and (1 + co)Br, for some 
numbers u > and B > 1. We assume we have oracles OsLinear and OFCover for the following two 



subproblems related to (3.4.2): 



Definition 47 (Minimum Cost Scalar-Linear Network Code Problem). Given a network J\f = 
{y, e, /i, c, A, S, R) with unit edge capacities, a finite field F and a length function I : e — > M+, 
compute the minimum cost scalar-linear network code solution for M with respect to F , if it exists. 
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The cost of a solution is the sum of lengths of the edges used in the solution. If there is no scalar- 
linear solution, then report "unsolvable. " 

Without the minimum cost condition, the above problem reduces to the decidability problem of 
determining whether or not a network has a scalar-linear solution, which is NP-hard by Theorem 
3.2 in Lehman and Lehman [21J. Hence, the above problem is at least as hard as any NP-hard 
problem. We assume that OsLinear solves the minimum cost scalar-linear network code problem 
exactly. 

Definition 48 (Fractional Covering with Box Constraints Problem). Given an nx m nonnegative 
integer matrix A, a nonnegative vector b, a positive vector c and a nonnegative integer vector u, 
compute 

min Y!j=i c 0Xi) 

s.t. EjMhJMj) > 6(»), Vl<f<n 
x{j) < u(j), VI < j < m 

x > 0. 

In Algorithm OracleRayApproxJLCode, OFCover solves the fractional covering problem of the 
following form with an approximation guarantee of B: 

min HiUy(i)Ui(i) 

s-t. Ei=iKL/2/(0 ^ <ij, vi<j<H ^ 43 ^ 
y > o, 

where wi, ... ,Wk> are the weight vectors associated with the network and Ui(l) is the cost of the 
minimum cost partial scalar-linear solution in with respect to the length function I. 
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Algorithm 6 Algorithm OracleRayApprox_LCode(AA, U), OsLinear, @FCover> B, 



10 
11 

12: 
13: 
14: 

15: 
16: 

17 

18 
19 
20 
21 
22 
23 
24 
25 
26 



Using Algorithm DSteinerTreePacking, compute the (approximate) value Z\ to (3.3.2) for each 

message mi separately. 

z = minj I 1 ; scaling -factor = ^ 

q = scaling -factor ■ q 

7] = J E u;6 = (\e\/(l-nB))-^ 

while true do 



, Ve G e 



5 

= 1,...,N do 



f = 0; Z(e) = 
for phase i 
7 = 1 

while 7 > do 

Using Os Linear, compute Ui(l) for each weight vector Wi and corresponding partial scalar- 
linear solution Wi with the minimum cost. 
Using O FCo 



W = y{l)Wi + 

\w 



solve (3.4.3) to get the y(l), . . . ,y(k') values. 

• • + y(k')w k ^ 

e such that W(e) > c(e)| 



W 

7 = 7 - fy 

Update /(e) = 1(e) 
if £ e Z(e)c(e) > 1 then 



1 , W(e) 
l + T l^eJ 



Goto Line | 
end if 
end while 
t = t + l 
end for 

scaling -factor = Iscaling _f actor \ q 
end while 



2q 



return scaling -factor ■ tj log 1 _ 
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Analysis 

The corresponding dual linear program of (3.4.2) is 



min ^2 e l(e)c(e) 

s-t- Eee^( e ) J ( e )-E 3 ti[4 z (j) > 0. Vl<i<V,VW€Wi 

Ej=i Qj z U) > i 

l,z > 0. 



(3.4.4) 



We rewrite the dual linear program (3.4.4) as two recursively nested linear programs. Consider 



the following linear program derived from the dual program: 



max Yl k j=i Qj z U) 



s-t. EtiH^O') < Ui(l), Vl<i<k' 



(3.4.5) 



> 0, 



where Ui(l) = mhxweWi Ee W(e)l(e). Let D(l) = X) e /(e)c(e) and a(l) be the optimal value of 
the linear program ( 3.4.5[ ). Then, solving (3.4.4) is equivalent to finding an assignment of lengths 
to the edges, I : e 



D(l) 
a(l) 



so as to minimize r r7rf. Let f3 denote the optimal value of (3.4.4), i.e., 



min^ -^jjj- The dual linear program for (3.4.5) is 



min EiLlVW^iCO 

s - t - Yn=i[wi\jy{i) > qj, VI < j < 

y > o. 



(3.4.6) 



Let a'(y) 



£tiK]^(0 



and D[(y) = Yli=i y(i)Ui(l). Without loss of generality, we 



assume that an optimal solution to (3.4.6) satisfies y(j) < \qj~\ for all j; then (3.4.6) is equivalent 



to (3.4.3). Solving (3.4.6) is equivalent to finding an assignment of values to variables y, y : 
{!,..., k'} -> ™+ - - W 



, so as to minimize ( . Let /3' be the optimal value of (3.4.6), i.e., (3' 



mm 



D'i(y) 

V a'(y) 



a'(y) 

. By linear programming duality, a (I) = /?'. Then, 



T" a(l) 



mm 



mm 

I 



D(l) 



mm 



y a'(y) 
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First, we consider Algorithm OracleRayApprox_LCode with the infinite while-loop in line 6 
removed, with variable scaling -factor and line 24 removed, and with the finite for-loop in line 8 
replaced with an infinite while-loop. 

Theorem 49. Assume (3 > 1. For <oj <1, Algorithm OracleRayApproxJLCode, with modifica- 



tions as explained above, returns a (1 + oj)B- approximate solution to the linear program (3.4.2) in 

number of phases. 



at most 



V B 1U &1+?? 1-r/B 



Proof. Let denote the length function / and Jij—i the variable 7 at the start of the j-th 

iteration (of the while-loop in line 10) in phase i. Let Wij denote the fractional solution W 
computed in the j-th iteration in phase i. Let li$ be the length function at the start of phase i, 
or equivalently, at the end of phase i — 1. For simplicity, we denote D(li + \fl) and a(/j + i j o) by D(i) 
and a(i). Note that for each phase i and iteration j, 

D(k,j) = ^/(e)c(e) 

e 

= £>(iij-i) + V^2h, J -i(e)W lJ (e) 

e 

= £>(iy_i) + r/ J2 ki-i ( e ) (7i,i-i -7i,i) b(W + • ■ ■ + I/CAOWVle 

e 

< £)(Zij_i) + rj(li,j-i ~ 7i,j) 5 «(^,i-i) 

< + r/(7ij_i - 7 i j)BQ!(Zi + i ) o), 

where Wi, . . . , Wj/ and y(l), • • ■ , y(A;') are the partial scalar-linear solutions and variable y from 
line 11. Note that we used the fact that a is a nondecreasing function to obtain the last inequality. 
Summing up the above inequality over the iterations, we obtain 

D(l i+lfi ) < D(l i+1 _i) + 77(71+1 -1 - 7 i+ i,o)-Ba(Zi + i i0 ) 

< D(l i+ i- 2 ) + v(li+i-2 ~ ji+i,o)Ba(li+i,o) 

< D(l ifi )+r]Ba(l l+lfi ), 

where h+i-j (similarly, denote the length function I (variable 7) at the start of the j-th 
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from the last iteration in phase i. It follows that D(i) < D(i — 1) + rjBa(i). From here, the analysis 
is the same as Theorem HUl with B replacing A. □ 



While the running time of Algorithm OracleRayApproxJLCode depends on j3 by Theorem 49 
we can reduce/increase it by scaling the q vector /capacities appropriately. We now remove the 
assumption that f3 > 1 and analyze the running time of Algorithm OracleRayApproxJLCode as a 
whole. 

Theorem 50. For < u < 1, Algorithm OracleRayApprox_LCode computes a (1 + uj)B- approx- 



imate solution to the linear program Q3.4.2D in time 0(lo~ 2 (log A\n\ + |e|)51og|e| • (T FCo ver + 
k'Ts Linear)), where Tpcover is the time required to solve the fractional covering problem by Opcover 
within an approximation guarantee B and T$ Linear is the time required to solve the minimum cost 



scalar-linear network code problem exactly by Osl 



tnear • 



Proof. Let z* be the exact maximum fractional Steiner tree packing value for message m,j in line 1 
and let z* = mhij Note that z* is an upper bound on the maximum rate at which the demands 
can be satisfied by a simple fractional linear code solution. Since Z\ < z* < Az^, A- < /3 < Az. We 
scale the q vector and get 1 < /3 < A\fi\. The assumption /3 > 1 is satisfied, but now f3 could be as 
large as A\fi\. We employ the doubling trick with N = \^ log 1+7? ppg ] • Then, the total number 
of phases is at most N log A\fj,\. Each phase consists of variably many number of iterations of the 
while-loop in line 10 and, in all iterations except the last, we increase the length of an edge by a 
factor of 1 + 77. Hence, the total number of iterations exceeds the total number of phases by at most 
lei lo g 1+7? j. The total number of calls to the oracle Opcover is at most AHog A\/j\ + |e| log 1+?? |. For 
each iteration, we call the oracle Osnnear exactly k' times to compute the Ui(l) values and, hence, 
the total number of calls to the oracle Osunear is at most (NlogA\\x\ + |e| log 1+r? \)k' . From here, 



the proof follows Theorem 42 closely. □ 



3.4.5 Implementations of Oracles 0$Linear an d Opcover 

In this subsection, we discuss implementations of oracles Os Linear and Opc 
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Oracle OsLinear 

We only consider implementations of exact oracle Os Linear f° r the minimum cost scalar-linear 



network code problem (Definition 47). The assumption that the finite field F is fixed for the 
computation of semi-network linear coding capacity region is important; it ensures termination of 
algorithms for Os Linear-, given that the decidability of the linear coding problem without a fixed 
finite field is unknown at this time. 

A brute force approach is to loop through all possible subset of active edges and try all possible 



combinations of global linear coding vectors (see Section 2.1) on these edges in time 0(2^ \F\^^). 
For each edge e = (x,y), it takes 0{\u\I) time to check if the global coding vector on e is the span 
of global coding vectors on in-edges of x, where / is the maximum in-degree of any node. The total 
running time is 0(2l e l|i ?1 |l' i ll e l|/i||e|/) and is exponential in not only in |e|, but also in the number of 
messages, The brute force approach requires 0(|.F|' M " e ') space. 

We propose a faster algorithm of our own that solves the minimum cost scalar-linear network 
code problem using dynamic programming. First, we relabel nodes so that edges go from a lower- 
numbered node to a higher-numbered one and arrange the nodes in a line in order. This can be 
done as the network is acyclic and by a topological sort algorithm. Let ni, . . . , m v \ be the nodes in 
order. Second, we create states indexed by a triple (i, £j, fa), where 

1. % is an integer, 1 < i < \v\ — 1, 

2. ej is the set of edges that have the start node in {n\, . . . , n{\ and the end node in . . ., 
n\ u \}, and 

3. fa is the global coding vectors on edges in ej. 

In each state (i,€i,fa), we store the minimum cost scalar-linear solution, if it exists, with the 
set of global coding vectors fa on e« when the network is restricted to nodes m, . . . , n%. In essence, 
we consider the restricted network of nodes ni, . . . , n, and compute its minimum cost scalar-linear 
solution based on the minimum cost scalar-linear solutions found on the restricted network of nodes 
n\, . . . , rii—\. See Algorithm OracleSLinear_DP for more details. Note that valid -next state and 
validjprev state are linked lists. 

Theorem 51. Algorithm OracleSLinear_DP solves the minimum cost scalar-linear network code 
problem in time OdF^^lvl \n\& 2 ) and space 0(1^1'^'*), where = maxj |ej|. 
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Proof. Let <3? = maxj |ej|. Then, the number of states with a specific pair of i and e« is at most \F\ 1^1* 
and the time to compute the states of the form (i, e$, </>j) from the states of the form (i— 1, ej_i, </>i-i) 
takes OdFl 2 ^!*!^!^ 2 ), by simply looping pairs of the states (i, e», 0») and (i — l,ej_i,0i_i) and 
checking if the global coding vectors in fa follow from those in fa—\. As % ranges from 1 to 
\v\ — 1, the total running time of the algorithm is 0(|-F| $ 2 ). The total space required is 

0(|.F|H*) if we use the "sliding window" trick where we only keep states in two consecutive levels 
(corresponding to two consecutive values of i) at any time. □ 



Algorithm 7 Algorithm OracleSLinear_DP(A/', I) 

1: Sort nodes with a topological sort algorithm: ni,.. . , n\„u 

2: valid jnext state <— 0; valid jprevstate <— (0,0,0), the trivial empty solution. 

3: for i = 1, . . . , \u\ — 1 do 

4: Update valid-prev state so that only states that satisfy demands on rtj exist in the list. 
5: valid -next state <— NULL 

6: for each possible coding vector combination fa+i on e^+i do 

7: Find, if possible, a state in valid jprev state that leads to a minimum cost solution for 
(i + l,e i+ i,fa + i). 

8: If successful, add the state (i + 1, e^+i, fa+i) to valid .next state. 
9: end for 

10: valid jprev start <— valid-next state 
11: end for 

12: From valid jprev state, pick the minimum cost solution that satisfies all the demands at n\ u \. 
13: return the minimum cost solution found or "unsolvable" if no such solution exists. 



Clearly, the asymptotic behavior of Algorithm OracleSLinear_DP is much better than that 
of the brute force approach when $ <C |e| in the network. We summarize algorithms for oracle 
OsLinear with Table pT2j 

Table 3.2: Implementations of oracle OsLinear 



Algorithm for OsLinear 


Approximation Ratio 


Time 


Space 


BruteForce 

Algorithm OracleSLinear_DP 


1 
1 


0(2l £ l|F 
0(\F\ 2 ^ 


WN|/i||e|/) 
*|z/||/i|$ 2 ) 


0(|i?|MM) 
0(|F|H*) 



Oracle FCo ve 



The oracle O FCover solves the fractional covering problem (Definition 48 ) . We can solve the problem 
with a polynomial-time linear program solver such as the ellipsoid algorithm. In our problem of 
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networks, the number of variables of the linear program (3.4.3) is at most 2^' and the number of 



constraints is Hence, a polynomial-time linear program solver will give an exact solution in 
time polynomial in 2'^L As the ellipsoid algorithm could be slow in practice and could depend on 
2^1 poorly, the actual running time might not be practical. Fleischer proposed a combinatorial 
approximation algorithm for the covering problem that solves within an approximation ratio of 
(1 + oj) by using 0(u; -2 2' /1 ' log(c T u)) calls to an oracle that returns a most violated constraint 
(where uj > 0). Note that Fleischer's algorithm is proposed for nonnegative integer matrix A and 
nonnegative integer vectors b, c and u, but the algorithm still works for our formulation of the 
fractional covering problem. As there are \u\ constraints and at most 2'^' variables, the oracle for 
the most violated constraint can be implemented in time 0(2^l|/x|), and this leads to an algorithm 
that computes a (1 + w)-approximate solution in time 0(w _2 2 2 '^' log(c T ti)). We summarize with 
Table 

Table 3.3: Implementations of oracle Opcover 



Algorithm for O cover 


Approximation Ratio B 


Time 


Ellipsoid Algorithm 
Fleischer p] 


1 

l + OJ 


polynomial (2'^' , /i ) 
0{u!- 2 2 2 M\n\log(c T u)) 



3.5 Examples 

In this section, we provide examples of network capacity regions in the case of two messages. Instead 
of Algorithm FacetEnum_Route (or FacetEnum_LCode), we use Algorithm BoundaryTrace2D as the 
polytope reconstruction algorithm. Algorithm BoundaryTrace2D is similar to the algorithm given 
by Cole and Yap [5] in that the common main idea is that three collinear boundary points define a 
face. Algorithm BoundaryTrace2D is different from Cole and Yap [5] in that it considers rays that 
start from the origin. See the pseudocode for the details of Algorithm BoundaryTrace2D. Note C is 



a linked list. We refer to Appendix A. 2.1 for a proof of its correctness. The algorithm works both 
for C r and C[ with an appropriate oracle On ay . 

The example networks are given in Figure |3-1[ Nodes 1 and 2 are the source nodes and nodes 
5 and 6 are the receiver nodes. To compute the exact description of C r or C[, we hard-coded the 
corresponding linear programs and used a linear program solver, linprog, in MATLAB. As the 
networks are simple, it was easy to enumerate all Steiner trees and partial scalar-linear solutions, 



60 



Algorithm 8 Algorithm BoundaryTrace2D(AA, OR ay ) 



1: C<- 

2: Using Ojiay, compute the intersection points on x = ei ■ t, t > for i = 1, 2 and obtain (xi, yi) 
and (x 2 ,2/2). 

3: Insert (xi, —1), (a?i, yi),(x2, 2/2), and (—1,2/2) onto the head of £, in that order. 

4: cur pointer <— the head of £ 

5: while there exist three distinct points after cur -.pointer do 

6: Let ptl, pt2, pt3, and pt4 be the four consecutive points starting at curjpointer. 

7: Let line l\ go through ptl and pt2, and line I2 go through pi3 and ptA. 

8: if intersection point p exists between l\ and I2 then 

9: Using Onay, compute the boundary point r on x = pt, t > 0. 

10: if r 7^ pt2 and r 7^ pt3 then 

11: Insert point r into £ between p£2 and p£3. 

12: else 

13: Advance curjpointer. 

14: end if 

15: else 

16: Advance curjpointer. 

17: end if 

18: end while 

19: return £u(0,0), except (xi,— 1) and ( — 1 , 2/2) ■ 



and the corresponding linear programs were small. While this approach worked for the particular 
examples we present, it might not be suitable for bigger or more complicated networks. To get 
around the numerical issues, we used the tolerance of .05; for instance, two points whose corre- 
sponding coordinates differ by at most .05 are considered the same point. For the networks, we 
assume A = {0, 1}. For the semi- network linear coding capacity regions, we assume the finite field 
F is F 2 . To obtain approximate intersections points, we simply used an approximate oracle On ay 
in place of the linear program solver. We note that the approximate oracle On ay worked well with 
Algorithm BoundaryTrace2D and led to its successful termination for these networks, but this may 
not hold for arbitrary networks in general. 



For the network routing capacity region of network N± in Figure 3-2 we used uj = .5 in 
Algorithms OracleRayApprox_Route and DSteinerTreePacking for Onay an d the algorithm due 
to Charikar et al. [1] for Onsteiner with the approximation ratio A = 0(log 2 \u\). For the semi- 



network linear coding capacity region of network Mi in Figure 3-3, we used 00 = .5 in Algorithms 
DracleRayApprox_Route and DSteinerTreePacking for OR ay , the algorithm due to Fleischer |llj 
with the approximation ratio B = 1.1 for Opcover, an d Algorithm 0racleSLinear_DP for OsLinear- 
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Figure 3-1: The Butterfly network with unit edge capacities and its variants: JVj., A/2, and A/3. 
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Figure 3-2: Network routing capacity region 
C r of M 
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Figure 3-3: Semi- network linear coding ca- 
pacity region C[ of Ai , with respect to F2 



For the network routing capacity region of network A/2 in Figure 3-4 we used uj = .5 in 
Algorithms OracleRayApprox_Route and DSteinerTreePacking for Onay an d the algorithm due 
to Charikar et al. [I] for Opsteiner with the approximation ratio A = 0(log 2 \u\). For the semi- 



network linear coding capacity region of network A2 in Figure 3-5, we used u = .5 in Algorithms 
OracleRayApprox_Route and DSteinerTreePacking for OR ay , the algorithm due to Fleischer |llj 
with the approximation ratio B = 1.1 for Opcover^ an d Algorithm OracleSLinear_DP for OsLinear- 



For the network routing capacity region of network A3 in Figure 3-6 we used u = .5 in Algo- 
rithms OracleRayApprox_Route and DSteinerTreePacking for On ay and the brute force algorithm 
for Oosteiner with the approximation ratio A = 1. For the semi-network linear coding capacity 
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Figure 3-4: Network routing capacity region 
C r of Af 2 
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Figure 3-6: Network routing capacity region 
C r of A/3 
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Figure 3-5: Semi-network linear coding ca- 
pacity region C[ of A/2 > with respect to ¥2 




0.2 0.4 0.6 0. 



1 1.2 1.4 1.6 1.8 

Message ml 



Figure 3-7: Semi-network linear coding ca- 
pacity region C[ of A/3, with respect to F2 



region of network A/3 in Figure 3-7, we used uj = .9 in Algorithm OracleRayApprox_Route, u = .5 
in Algorithm DSteinerTreePacking, the algorithm due to Fleischer [IT] with the approximation 
ratio B = 1.1 for O fc oven and Algorithm 0racleSLinear_DP for OsLinear- 



3.6 Discussion and Conclusion 

In this chapter, we defined the network capacity region of networks analogously to the rate regions in 
information theory. In the case of the network routing capacity region, we showed that the region is 
a rational polytope and provided exact algorithms and approximation heuristics for computing the 
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polytope. In the case of the network linear coding capacity region, we defined an auxiliary polytope, 
the semi-network linear coding capacity region, that is a rational polytope and that inner bounds the 
network linear coding capacity region. We provided exact algorithms and approximation heuristics 
for computing the auxiliary polytope. We noted that the algorithms and heuristics presented in 
this chapter are not polynomial time schemes. 

Our results have a few straightforward extensions. We can design membership algorithms 
that given a rate vector, determines whether or not there exists a fractional network code solu- 
tion that achieves it from algorithms we provided, for the network routing capacity region and 
semi-network linear coding capacity region. Also, we can compute corresponding approximate so- 



lutions (x(T))t£T f° r linear program (3.3.1) by storing counters for Steiner trees T in Algorithm 
OracleRayApprox_Route; the same is true for (x(W))weW- 

While our results apply to the networks defined on directed acyclic multigraphs, they generalize 
to directed networks with cycles and undirected networks straightforwardly. In the computation 
of the network routing capacity region, we consider minimal fractional routing solutions that can 
be decomposed into a set of Steiner trees defined appropriately for directed networks with cycles 
(or undirected networks) and modify the algorithms correspondingly. In the computation of the 
semi-network linear coding capacity region, we consider simple fractional linear coding solutions 
that can be decomposed into a set of partial scalar-linear solutions defined appropriately for the 
networks and modify the algorithms correspondingly. 

In connection to Cannons et al. [2], our work essentially addresses a few problems proposed 
by Cannons et al.: whether there exists an efficient algorithm for computing the network routing 
capacity and whether there exists an algorithm for computing the network linear coding capacity. 
It follows from our work that there exist combinatorial approximation algorithms for computing the 
network routing capacity and for computing a lower bound of the network linear coding capacity. 

We conclude with a few open problems related to our work: determine how good of an inner 
bound the semi-network linear coding capacity region is to the network linear coding capacity 
region; design an efficient algorithm, if possible, for computing the linear coding capacity region 
and network capacity region; design an efficient algorithm, if possible, for the minimum cost scalar- 
linear network code problem. 
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Appendix A 



Computations and Proofs 



A.l Computations 



A. 1.1 Computation in Theorem 39 



From Section 2 of Garg and Konemann [13] and the fact that oracle Onsteiner is an approximate 
oracle with the approximation guarantee of A, it follows that the ratio of dual optimal and primal 
feasible solutions, (, satisfies 

(<(l- V )- 2 A. 

We choose r/ appropriately to make sure that the £ < (1 + (J) A. It suffices to choose r/ such that 
(1 — ij)~ 2 < 1 + oj, or equivalently, (1 — r/)^ 1 < (1 + oj) 1 / 2 . By the Taylor Series Theorem, for 
< oj < 1, we have 



1 + \oj- 1 -oj 2 <{1 + oj) 1 ' 2 . 



Note that for < 7] < \, 



(1 - r])- 1 = l + r] + irf + ... = l + Vt~^— < 1 + 27?. 

1 — 7] 

Then, for r\ = ^cj, we have that < uj < 1 implies < r] < g and that 

(1 - 7])- 1 < 1 + 277 < 1 + < 1 + -u - -oj 2 < (1 + oj) 1/2 . 

8 2 8 
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A. 1.2 Computation in Theorem [42 



We want to choose r\ appropriately so that (1 — rjA) 3 < (1 + w), or equivalently, that (1 — r/A) 1 < 
(1 + W) 1 / 3 . By the Taylor Series Theorem, for < ui < 1, we have 



Note that for < rjA < §, 



(1 - r/A)" 1 = 1 + r\A + (??^) 2 + . . . = 1 + r]A 1 < 1 + 2r/A. 

1 — rjA 

Then, for r\ = g^w, we have that < ui < 1 implies < r\A < ^ and that 

(1 - V A)- 1 < 1 + 2ijA < 1 + \u < 1 + \oj - \u 2 < (1 + u) 1 ^. 

9 o 9 

A. 2 Proofs 

A. 2.1 Proof for Algorithm BoundaryTrace2D 

Without loss of generality, we prove that Algorithm BoundaryTrace2D is correct for the computa- 
tion of the network routing capacity region C r . Note that £ is a linked list of computed boundary 
points (and the two auxiliary points (x\,— 1) and (—1,2/2)) ordered clockwise. When two lines 
intersect, we mean that the lines intersect in exactly one point. When a line goes through a line 
segment, we mean that the line intersects the line segment in exactly one point. 

Theorem 52. Algorithm BoundaryTrace2D calls the oracle On ay 0{n) times where n is the number 
of edges in the polygon C r . 

Proof. We first show that, for each edge e of C r , the number of distinct boundary points computed in 
the interior of the edge (excluding the vertices) is at most 3 throughout the execution of Algorithm 
BoundaryTrace2D. Let edge e have 3 distinct boundary points computed in its interior: pi, P2, and 
Pz, ordered clockwise. Then, any 4 consecutive boundary points in C cannot produce a ray in line 
9 that goes through the interior of edge e. If the set of 4 consecutive points contains at most 2 of 
Pi, P2, and P3, then, clearly, the ray does not go through the interior of e. If the set of 4 consecutive 
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points contains all 3 points p±,P2, and p% and a point before p\ in C, then the intersection point p 
in line 8 is exactly pt2 and no new boundary point is created. Similarly, it can be shown in other 
remaining cases that no new boundary point is introduced. 

Note that, for each boundary point b in £, there can be at most 3 calls to oracle OR ay associated 
with it; a call to compute the boundary point for the first time, a call if b appears as pt2 in line 6 
and as the boundary point r in line 9, and a call if b appears as pt3 in line 6 and as the boundary 
point r in line 9. As there are n — 2 edges and n — 1 vertices on the outer boundary of C r and 0(1) 
calls to ORay for each boundary point computed, the statement follows. □ 

The correctness of BoundaryTrace2D follows from the fact that each vertex on the outer bound- 
ary of C r is computed by oracle OR ay and that after enough boundary points have been computed, 
the cur. pointer in the algorithm will advance to termination. It is easy to see that all vertices of the 
polygon C r are included in the returned list C. Assume a vertex v is missed in C and the algorithm 
terminated successfully. Assume that ptl, pt2, pt3 and pt4 are the four consecutive points in the 
resulting list such that the line segment between the origin and v and the line segment between 
pt2 and pt3 intersect. Note that pt2 and pt3 are on different edges of C r . Then, it is easy to see 
that we have the ray in line 9 going through the interior of the line segment between pt2 and pt3 
and, hence, a new boundary point would have been added. Therefore, cur -pointer should not have 
advanced to termination, and this contradicts that the vertex v is missing from C. 
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